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(54) Tide: ENTERICALLY TRANSMITTED NON-A/NON-B HEPATITIS VIRAL AGENT 



(57) Abstract 



Viral proteins derived from an enterically transmitted non-A/ 
non-B viral hepatitis agent are disclosed. In one embodiment, the pro- 
tein is immunologically reactive with antibodies present in individuals 
infected with the viral hepatitis agent This protein is useful in a diag- 
nostic method for detecting infection by the enterically transmitted 
agent Also disclosed are DNA probes derived from a cloned se- 
quence of the viral agent These probes are useful for identifying and 
sequencing the entire viral agent and for assaying the presence of the 
viral agent in an infected sample, using probe-specific amplification 
of virus-derived DNA fragments. 
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ENTERICALLY TRANSMITTED NON-A/NON-B 
HEPATITIS VIRAL AGENT 



CROSS-REFERENCE TO RELATED APPLICATIONS 
This application is a continuation-in-part of 
U.S. Application Serial No. 336,672, filed April 11, 
1989, which is a continuation-in-part of U.S. 
Application Serial No. 208,997, filed June 17, 1988, 
both of which are herein incorporated by reference. 

15 INTRODUCTION 



Field of Invention 

This invention relates to recombinant 
proteins, genes, and gene probes and more specifically 
to such proteins and probes derived from an enterically 
transmitted nonA/nonB hepatitis viral agent, and to 
diagnostic methods and vaccine applications which 
employ the proteins and probes. 
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25 Background 

Enterically transmitted non-A/non-B (ET-NANB) 
hepatitis viral agent is the reported cause of 
hepatitis in several epidemics and sporadic cases in 
Asia, Africa, and the Indian subcontinent. Infection 

30 is usually by water contaminated with feces, although 
the virus may also spread by close physical contact. 
The virus does not seem to cause chronic infection. 
The viral etiology in ET-NANB has been demonstrated by 
infection of volunteers with pooled fecal isolates; 

35 Immune electron microscopy (IEM) studies have shown 

virus particles with 27-34 nm diameters in stools from 
infected individuals. The virus particles reacted with 
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antibodies in serum from infected individuals from 
geographically distinct regions, suggesting that a 
single viral agent or class is responsible for the 
majority of ET-NANB hepatitis seen worldwide. No 
antibody reaction was seen in serum from individuals 
infected with blood-transmitted NANB virus, indicating 
a different specificity between the two NANB types. 

In addition to serological differences, the 
two types of NANB infection show distinct clinical 
dif-ferences. ET-NANB is characteristically an acute 
infection, often associated with fever and arthralgia, 
and with portal inflammation and associated bile stasis 
in liver biopsy specimens (Arankalle) . Symptoms are 
usually resolved within six weeks. Blood-transmitted 
15 NANB, by contrast, produces a chronic infection in 
about 50% of the cases. Fever and arthralgia are 
rarely seen, and inflammation has a predominantly 
parenchymal distribution (Khuroo, 1980). The two viral 
agents can also be distinguished on the basis of 
20 primate host susceptibility. ET-NANB, but not the 
blood-transmitted agent, can be transmitted to 
cynomolgus monkeys. The blood-transmitted agent is 
more readily transmitted to chimpanzees than is ET-NANB 
(Bradley, 1987). 

25 There have been major efforts worldwide to 

identify and clone viral genomic sequences associated 
with ET-NANB hepatitis. One goal of this effort, 
requiring virus-specific genomic sequences, is to 
identify and characterize the nature of the virus and 

30 its protein products. Another goal is to produce 
recombinant viral proteins which can be used in 
antibody-based diagnostic procedures and for a 
vaccine. Despite these efforts, viral sequences 
associated with ET-NANB hepatitis have not been 

35 successfully identified or cloned heretofore, nor have 
any virus-specific proteins been identified or 
produced. 
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SUMMARY OF THE INVENTION 

25 Novel compositions, as well as methods of 

preparation and use of the compositions are provided, 
where the compositions comprise viral proteins and 
fragments thereof derived from the viral agent for ET- 
NANB. Methods for preparation of ET-NANB viral 

30 proteins include isolating ET-NANB genomic sequences 
which are then cloned and expressed in a host cell. 
The resultant recombinant viral proteins find use as 
diagnostic agents and as vaccines. The genomic 
sequences and fragments thereof find use in preparing 

35 ET-NANB viral proteins and as probes for virus 
detection. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 shows vector constructions and 
manipulations used in obtaining and sequencing cloned 
ET-NANB fragment; and 
5 Figures 2A-2B are representations of Southern 

blots in which a radiolabeled ET-NANB probe was 
hybridized with amplified cDNA fragments prepared from 
RNA isolated from infected (I) and non-infected (N) 
bile sources (2A), and from infected (I) and non- 
10 infected (N) stool-sample sources (2B). 

DESCRIPTION OF SPECIFIC EMBODIMENTS 
Novel compositions comprising generic 
sequences and fragments thereof derived from the viral 
15 agent for ^ET-NANB are provided, together with 

recombinant viral proteins produced using the genomic 
sequences and methods of using these compositions. 

The genome of the ET-NANB viral agent is 
identified as containing a region which is homologous 
to the 1.33 kb DNA EcoRI insert present in plasmid pTZ- 
KFl (ETl.l) carried in E. coli strain BB4 and having 
ATCC deposit no. 67717. Initial studies sequenced the 
two terminal regions of the insert and an intermediate 
region. The 5 '-end region of this insert contains the 
25 sequence: 



20 



1 42 
GAT GGA AGG CAC TAA TCT GGC AAG ACC TGT CCC TGT TGC AGC 

30 43 

84 

TGT TCT ACC ACC CTG CCC CGA GCT CGA ACA GGG CCT TCT CTA 
85 

126 

CCT GCC CCA GGA GCT CAC ACA CCC TGT GAT AGT GTC GTA ACA 

168 

TTT GAA TTA ACA GAC ATT GTG CAC TGC CGC ATG GCC GCC CCG 
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169 210 
AGC CAG CGC AAG GCC GTG CTG TCC ACA CTC GTG GGC CGC TAC 



5 211 
GGC. 

An intermediate region has the sequence: 

10 691 731 
CTA GAG TGT GCT ATT ATG GAG GAG TGT GGG ATG CCG CAG TGG 

733 774 
CTC ATC CGC CTG TAT CAC CTT ATA AGG TCT GCG TGG ATC TTG 

15 

775 816 

CAG GCC CCG AAG GAG TCT CTG CGA GGG TTT TGG AAG AAA CAC 

817 858 
20 TCC GGT GAG CCC GGC ACT CTT CTA TGG AAT ACT GTC TGG AAT 

859 900 
ATG GCC GTT ATT ACC CAC TGT TAT GAC TTC CGC GAT TTT CAG 

25 901 9 42 
GTG GCT GCC TTT AAA GGT GAT GAT TCG ATA GTG CTT TGC AGT 

9" 984 
GAG TAT CGT CAG AGT CCA GGA GCT GCT GTC CTG ATC GCC GGC 

30 

985 1026 
TGT GGC TTG AAG TTG AAG GTA GAT TTC CGC CCG ATC GGT TTG 



35 



1027 
TAT. 
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The 3' -end region contains the sequence: 

1191 1232 
TGA GTA GAG GAT GTT GTT TCC CGT GTT TAT GGG GTT TCC CCT 

1233 1274 
GGA CTC GTT CAT AAC CTG ATT GGC ATG CTA CAG GCT GTT GCT 

1275 1316 
GAT GGC AAG GCA CAT TTC ACT GAG TCA GTA AAA CCA GTG CTC 

1317 1327 
GAC CGG AAT TC. 

Additional work has provided the entire sequence, in 
both directions, as set forth below. The sequence of 
both strands is provided, since it is not known in 
which strand the gene is located. However, the 
sequence in one direction has been designated as the 
"forward" sequence because of statistical similarities 
to known proteins. This sequence is set forth below 
along with the three possible translation sequences. 
There is one long open reading frame that starts at 
nucleotide 145 with an isoleucine and extends to the 
end of the sequence. The two other reading frames have 
many termination codons. Standard abbreviations for 
nucleotides and- amino acids are used here and elsewhere 
in this, specification. 

Forward Sequence 

RIPPTDGRHZSGKTCPCCSC 
EFRQLMEGTNLARPVPVAAV 
NSA NZWKALIWQDLSLLQLF 

* 

1 11 21 31 41 51 

CGAATTCCGCCAACTGATGGAAGGCACTAATCTGGCAAGACCTGTCCCTGTTGCAGCTGT 
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STTLPRARTGPSLPAPGAHH 

LPPCPELEQGLLYLPQELTT 

YHPAPSSNRAPSTCPRSSPP 
****** 

61 71 81 91 101 in .. 

TCTACCACCCTGCCCCGAGCTCGAACAGGGCCTTCTCTACCTGCCCCAGGAGCTCACCAC 

LZZCRN IZINRHCALPHGRP 
CDSVVT FELTD IVHCRMAAP 

VIVSZHLNZQTLCTAAWPPR 
****** 

121 131 141 151 161 171 

CTGTGATAGTGTCGTAACATTTGAATTAACAGACATTGTGCACTGCCGCATGGCCGCCCC 

EPAQGRAVHTRGPLRRRTKL 

SQRKAV LSTLVGRYGVAQSS 

ASARPCCP HSWAATASHKAL 
****** 

181 191 201 211 221 231 

GAGCCAGCGCAAGGCCGTGCTGTCCACACTCGTGGGCCGCTACGGCGTCGCACAAAGCTC 

YNASHSDVRDSLARFIPAIG 

TMLPTLMPATLSPV LSRPLA 

QCFPLZCSRLSRPFYPGHWP 
* * * * * * 

241 251 261 271 281 291 

TACAATGCTTCCCACTCTGATGTTCGCGACTCTCTCGCCCGTTTTATCCCGGCCATTGGC 

PVQVTTCELYELVEAMVEKG 

PYRLQLVNCTSZWRPWSRRA 

RTGYNLZIVRASGGHGREGP 
****** 

301 311 321 331 341 351 

CCCGTACAGGTTACAACTTGTGAATTGTACGAGCTAGTGGAGGCCATGGTCGAGAAGGGC 
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QDGSAVLELDLCNRDVSRIT 

RMAPPSLSLIFATVTCPGSP 

GWLRRP2A2SLQPZRVQDHL 
* * * * 

361 371 381 391 401 4U 

CAGGATGGCTCCGCCGTCCTTGAGCTTGATCTTTGCAACCGTGACGTGTCCAGGATCACC 

FFQKDCNKPTTGETIAHGKV 
S SR K IVTSSPQVRPLPMVKW 

LPERL ZQVHHRZDHCPWZSG 
* 

421 431 441 451 461 471 

TTCTTCCAGAAAGATTGTAACAAGTTCACCACAGGTGAGACCATTGCCCATGGTAAAGTG 

GQGIS AWSKTPCALPGPWPR 
ARASRPGARPSAPS L A L G S A 

PGHLGLEQDLLRPLWPLV PR 
* * * * * * 

481 491 501 511 521 531 

GGCCAGGGCATCTCGGCCTGGAGCAAGACCTTCTGCGCCCTCTTTGGCCCTTGGTTCCGC 

A I EKAILALLP QGVFYGDAP 

LLRRLPWPCSLR VCFTVMPL 

YZEGXSGPAPSGCV LRZCLZ 
* * * * * 

541 551 561 571 581 591 

GCTATTGAGAAGGCTATTCTGGCCCTGCTCCCTCAGGGTGTGTTTTACGGTGATGCCTTT 

DDTVPS AAVAAAKASMVPEN 
MTPSSRRLWPQQRHPMCLRM 
Z HRLLGGCGRSKG IHG VZ E Z 

* 

601 611 621 631 641 651 

GATGACACCGTCTTCTCGGCGGCTGTGGCCGCAGCAAAGGCATCCATGGTGTTTGAGAAT 
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DPSEFDSTQNNFSLGLECAI 

TFLSLTPPRITFLWVZSVLL 

LFZVZLHPEZLFSGS-RVCYY 
****** 

661 671 681 691 701 711 

GACTTTTCTGAGTTTGACTCCACCCAGAATAACTTTTCTCTGGGTCTAGAGTGTGCTATT 

MEECGMPQWLIRLYHLIRSA 

WRSVGCRSGSSACITLZGLR 

GGVWDAAVAHPPVSPYKVCV 
****** 

721 731 741 751 761 771 

ATGGAGGAGTGTGGGATGCCGCAGTGGCTCATCCGCCTGTATCACCTTATAAGGTCTGCG 

WILQAPKESLRGFWKKHS GE 

GSCRPRRSLCEGFGRNTPVS 

DLAGPEGVSARVLEETLRZA 
* * * * * * 

781 791 801 811 821 831 

TGGATCTTGCAGGCCCCGAAGGAGTCTCTGCGAGGGTTTTGGAAGAAACACTCCGGTGAG 

PGTLLWNTVWNMAVITHCYD 

PALFYGILSGIWPLLPTVMT 

RHSSMEYCLEYGRYYPLLZL 
****** 

841 851 861 871 881 891 

CCCGGCACTCTTCTATGGAATACTGTCTGGAATATGGCCGTTATTACCCACTGTTATGAC 

FRDFQVAAFKGDDSIVLCSE 

SAIFRWLPLKVMIRZCFAVS 

PRFSGGCLZRZZFDSALQZV 
****** 

901 911 921 931 941 951 

TTCCGCGATTTTCAGGTGGCTGCCTTTAAAGGTGATGATTCGATAGTGCTTTGCAGTGAG 
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XRQSPGAAVLIAGCG LKLKV 
IVRVQELLSZSPA. VAZSZRZ 
SSESRSCCPDRRLWLEVEGR 

* * * * * * 

961 971 981 991 1001 1011 

TATCGTCAGAGTCCAGGAGCTGCTGTCCTGATCGCCGGCTGTGGCTTGAAGTTGAAGGTA 

DFRPIGLYAGVVVAPGLGAL 

IS ARSVCMQVLWWPPAL ARS 
FPPDRP VCRCCGGPRPWRAp 
* 

1021 1031 1041 1051 1061 i 0 7i 

GATTTCCGCCCGATCGGTTTGTATGCAGGTGTTGTGGTGGCCCCCGGCCTTGGCGCGCTC 

PDVVRPAGRLTE .K NWGPGPE 
LMLC AS PAG.LPRR I GALALS 

ZCCALRRPAYREELGPWPZA 

* * * * * * 

1081 1091 1101 1U1 U21 H3i 

CCTGATGTTGTGCGCTTCGCCGGCCGGCTTACCGAGAAGAATTGGGGCCCTGGCCCTGAG 

RAEQLRLA. VSDPLRKLTNVA 
GRSSSASLLVISSASSRMZL 

GGAAPPRC ZZPPPQAHEC SS 
* * * * * 

1141 1151. 1161 1171 1181 U9i 

CGGGCGGAGCAGCTCCGCCTCGCTGTTAGTGATTTCCTCCGCAAGCTCACGAATGTAGCT 

QMCVDVVSRVYGVSPGLVHN 

RCVWMLPPVFMGPPLDSPIT 

DVCG CCPPCLWG FPWTRSZP 
* 

1201 1211 1221 1231 1241 1251 

CAGATGTGTGTGGATGTTGTTTCCCGTGTTTATGGGGTTTCCCCTGGACTCGTTCATAAC 
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LIGMLQAVADG KAHFTESVK 
ZLACYRLLLM ARHISLSQZN 

DWHATGCCZWQGTPH2VSKT 

* * * * * * 

1261 1271 1281 1291 1301 1311 

CTGATTGGCATGCTACAGGCTGTTGCTGATGGCAAGGCACATTTCACTGAGTCAGTAAAA 

PVLDRNSS 
Q C S T G I R 

S A R P E F E 

* * * 

1321 1331 1341 

CCAGTGCTCGACCGGAATTCGAGC 

The complimentary strand, referred to here as 
the "reverse sequence," is set forth below in the same 
manner as the forward sequence set forth above. 
Several open reading frames, shorter than the long open 
reading frame found in the forward sequence, dan be 
seen in this reverse sequence. 

Reverse Sequence 

ARIPVEHWFYZLSE MCLAIS 

LEFRSSTGFT DSVKC ALPSA 

SNSGRALVLLTQZNVPCHQQ 
****** 

1 11 21 31 41 51 

GCTCGAATTCCGGTCGAGCACTGGTTTTACTGACTCAGTGAAATGTGCCTTGCCATCAGC 

NSLZHANQVMNESRGNPINT 

TACSMPIRLZTSPGETPZTR 

QPVACQSGYERVQGKPHKHG 
****** 

61 71 81 91 101 HI 

AACAGCCTGTAGCATGCCAATCAGGTTATGAACGAGTCCAGGGGAAACCCCATAAACACG 



WO 89/12462 PCI7US89/02648 

12 

GNNIHTHLSYIRELAEEITN 
ETTSTHIZA. TPV SLRRRSLT 
KQHPHTSELHSZACGGNHZ Q 
* * * * 

121 131 141 151 161 171 

GGAAACAACATCCACACACATCTGAGCTACATTCGTGAGCTTGCGGAGGAAATCACTAAC 

SE AELLRPLRARAPILLGKP 
ARRSCSARSGPGPQPFSVSR 

R GGAA PPAQGQGPNSSRZAG 

* * * * * * 

181 191 201 211 221 231 

AGCGAGGCGGAGCTGCTCCGCCCGCTCAGGGCCAGGGCCCCAATTCTTCTCGGTAAGCCG 

AGEAH NIRERAKAGGHHNTC 

PAECRTTSGSAPR PGATTTPA 

RRSAQHQGARQGRGPPQHLH 
* * . * * * 

24i 2S1 261 271 281 291 

GCCGGCGAAGCGCACAACATCAGGGAGCGCGCCAAGGCCGGGGGCCACCACAACACCTGC 

IQTDRAEIYLQLQATAGDQD 
XKP IGRKSTPNPKPQPAIRT 

TNRSGGNLP STSSH SRRSGQ 

* * * * * * 

301 311 321 331 341 351 

ATACAAACCGATCGGGCG6AAATCTACCTTCAACTTCAAGCCACAGCCGGCGATCAGGAC 

SSSWTLTILTAKHYRIITFK 
A A P G L Z R.Y SLQSTIESSPLK 
QLLDSDDTHCKALSNHHLZR 

* 

261 371 381 391 401 4H 

AGCAGCTCCTGGACTCTGACGATACTCACTGCAAAGCACTATCGAATCATCACCTTTAAA 
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GSHLKIAEVITVGNNGHIPD 

AATZKSRKSZQWVITAIFQT 

QPPENRGSHNSGZZRPYSRQ 
* * * * * * 

421 431 441 451 461 471 

GGCAGCCACCTGAAAATCGCGGAAGTCATAACAGTGGGTAATAACGGCCATATTCCAGAC 

SIPZKSAGLTGVFLPKPSQR 

VFHRRVPGSPECPPQN PRRD 

YSIEECRAHRSVSSKTLAET 
****** 

481 491 501 511 521 531 

AGTATTCCATAGAAGAGTGCCGGGCTCACCGGAGTGTTTCTTCCAAAACCCTCGCAGAGA 

LLRGLQDPRRPYRVIQADEP 

SFGACKIH ADLIRZYRRM SH 

PSGPARSTQTLZGDTGGZAT 
* * * * * * 

541 551 561 571 581 591 

CTCCTTCGGGGCCTGCAAGATCCACGCAGACCTTATAAGGTGATACAGGCGGATGAGCCA 

LRHP T LLHNST LZTQRKV I L 
CGIPHSSIIAHSRPREKLFW 

AASHTPPZZHTLDP EKSYSG 
****** 

601 611 621 631 641 651 

CTGCGGCATCCCACACTCCTCCATAATAGCACACTCTAGACCCAGAGAAAAGTTATTCTG 

GGVKLRKVILKHH. GCLCCGH 
VESNSEKSFSNTMDAFAAAT 
WSQTQKSHSQTPWMPLLRPQ 
****** 

661 671 681 691 701 711 

GGTGGAGTCAAACTCAGAAAAGTCATTCTCAAACACCATGGATGCCTTTGCTGCGGCCAC 
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SRR EDGVIKGI TVKHTLR EQ 

AAEKTVSSKASPZNTPZGSR 

PPRRRCHQRHHRKTHPEGAG 
****** 

721 731 741 751 761 771 

AGCCGCCGAGAAGACGGTGTCATCAAAGGCATCACCGTAAAACACACCCTGAGGGAGCAG 

GQNSLLNSAEPRAKEGAEGL 
ARIAPSIARNQGPKRAQKVL 
PEZPS QZRGTKGQ.RGRRRSC 

.* 

781 791 801 811 821 831 

GGCCAGAATAGCCTTCTCAATAGCGCGGAACCAAGGGCCAAAGAGGGCGCAGAAGGTCTT 

APGRDALAHFTMGNGLTCGE 
LQAEMPWPTLPWAMVSP7VN 
SRPRCPGPLYHGQW S H L W Z T 

* 

841 851 861 871 881 891 

GCTCCAGGCCGAGATGCCCTGGCCCACTTTACCATGGGCAATGGTCTCACCTGTGGTGAA 

LVTIPLEEGDPGHVTVAKIK 
LLQSPWK KVILDTSRL QRSS 
CYNLSGRRZSWTRHGCKDQA 

* 

901 911 921 931 941 951 

CTTGTTACAATCTTTCTGGAAGAAGGTGATCCTGGACACGTCACGGTTGCAAAGATCAAG 

LKDGGAILALLDHGLHZLVQ 

SRTAEPSWPPSTMASTSSYN 

QGRRSHPGPSRPWPPLARTI 
* * 

961 971 981 991 1001 1Q11 

CTCAAGGACGGCGGAGCCATCCTGGCCCTTCTCGACCATGGCCTCCACTAGCTCGTACAA 
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FTSCNLYGANGRDKTGERVA 
SQVVTCTGPMAG IKRARESR 

HKLZPVRGQWPGZNGRESRE 
****** 

1021 1031 1041 1051 1061 1071 

TTCACAAGTTGTAACCTGTACGGGGCCAATGGCCGGGATAAAACGGGCGAGAGAGTCGCG 

NIRVGSIVELCATPZRPTSV 
TSEWEALZSPVRRRSGPRVW 

HQSGKHCRALCDAVAAHECG 

* * * * * * 

1081 1091 1101 1111 1121 1131 

AACATCAGAGTGGGAAGCATTGTAGAGCTTTGTGCGACGCCGTAGCGGCCCACGAGTGTG 

DSTALRWLGAAHRQCTMSVN 
TARPCAGS6RPC6-SAQCLL I 
QHGLALARGGHAAVHNVCZF 

* * * * * * 

1141 1151 1161 1171 1181 1191 

GACAGCACGGCCTTGCGCTGGCTCGGGGCGGCCATGCGGCAGTGCACAATGTCTGTTAAT 

SNVTTLSQVVSSWGRZRRPC 

QMLRHYHRWZAPGAGREGPV 

KCYDTITGGELLGQ VEKALF 
****** 

1201 1211 1221 1231 1241 1251 

TCAAATGTTACGACACTATCACAGGTGGTGAGCTCCTGGGGCAGGTAGAGAAGGCCCTGT 

SS SGQGGRTAATGTGLARLV 

RARGRVVEQLQQGQVLPDZC 

ELGAGWZNSCNRDRSCQISA 
****** 

1261 1271 1281 1291 1301 1311 

TCGAGCTCGGGGCAGGGTGGTAGAACAGCTGCAACAGGGACAGGTCTTGCCAGATTAGTG 
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PSISWRNS 
L P S V G G I 
F H Q L A E F 
* * * 

1321 1331 1341 

CCTTCCATCAGTTGGCGGAATTCG 



Identity of this sequence with sequences in 
etiologic agents has been confirmed by locating a 
corresponding sequence in a viral strain isolated in 
Burma. The Burmese isolate contains the following 
sequence of nucleotides. 

Non-A Non-B ET: Burmese strain 

1 CGGTTGTTCA GTACCAGTTT ACTGCAGGTG TGCCTGGATC CGGCAAGTCC 
51 CGCTCTATCA CCCAAGCCGA TGTGGACGTT GTCGTGGTCC CGACGCGTGA 
101 GTTGCGTAAT GCCTGGCGCC GTCGCGGCTT TGCTGCTTTT ACCCCGCATA 
151 CTGCCGCCAG AGTCACCCAG GGGCGCCGGG TTGTCATTGA TGAGGCTCCA 
201 TCCCTCCCCC CTCACCTGCT GCTGCTCCAC ATGCAGCGGG CCGCCACCGT 
251 CCACCTTCTT GGCGCCCCGA ACCAGATCCC AGCCATCGAC TTTGAGCACG 
301 CTGGGCTCGT CCCCGCCATC AGGCCCGACT TAGCCCCACC TCCTGGTGGC 
351 ATGTTACCCA TCGCTGCCCT GCGGATGTAT GCGAGCTCAT CCGTGGTGCA 
401 TACCCCATGA TCCAGACCAC TAGCCGGGTT CTCCGTTCGT TGTTCTGGGG 
451 TGAGCCTGCC GTCGGGCAGA AACTAGTGTT .CACCCAGGCG GCCAAGGCCG 
501 CCAACCCCGG CTCAGTGACG GTCCACGAGG CGCAGGGCGC TACCTACACG 
551 GAGACCACTA TTATTGCCAC AGCAGATGCC CGGGGCCTTA TTCAGTCGTC 
601 TCGGGCTCAT GCCATTGTTG CTCTGACGCG CCACACTGAG AAGTGCGTCA 
651 TCATTGACGC ACCAGGCCTG CTTCGCGAGG TGGGCATCTC CGATGCAATC 
701 GTTAATAACT TTTTCCTCGC TGGTGGCGAA ATTGGTCACC AGCGCCCATC 
751 AGTTATTCCC CGTGGCAACC CTGACGCCAA TGTTGACACC CTGGCTGCCT 
801 TCCCGCCGTC TTGCCAGATT AGTGCCTTCC ATCAGTTGGC TGAGGAGCTT 
851 GGCCACAGAC CTGTCCCTGT TGCAGCTGTT CTACCACCCT GCCCCGAGCT 
901 CGAACAGGGC CTTCTCTACC TGCCCCAGGA GCTCACCACC TGTGATAGTG 
951 TCGTAACATT TGAATTAACA GACATTGTGC ACTGCCGCAT GGCCGCCCCG 
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1001 AGCCAGCGCA AGGCCGTGCT GTCCACACTC GTGGGCCGCT ACGGCGTCGC 
1051 ACAAAGCTCT ACAATGCTTC CCACTCTGAT GTTCGCGACT CTCTCGCCCG 
1101 TTXTATCCCG GCCATTGGCC CCGTACAGGT TACAACTTGT GAATTGTACG 
1151 AGCTAGTGGA GGCCATGGTC GAGAAGGGCC AGGATGGCTC CGCCGTCCTT 
1201 GAGCTTGATC TTTGCAACCG TGACGTGTCC AGGATCACCT TCTTCCAGAA 
1251 AGATTGTAAC AAGTTCACCA CAGGTGAGAC CATTGCCCAT GGTAAAGTGG 
1301 GCCAGGGCAT CTCGGCCTGG AGCAAGACCT TCTGCGCCCT CTTTGGCCCT 
1351 TGGTTCCGCG CTATTGAGAA GGCTATTCTG GCCCTGCTCC CTCAGGGTGT 
1401 GTTTTACGGT GATGCCTTTG ATGACACCGT CTTCTCGGCG GCTGTGGCCG 
1451 CAGCAAAGGC ATCCATGGTG TTTGAGAATG ACTTTTCTGA GTTTGACTCC 
1501 ACCCAGAATA ACTTTTCTCT GGGTCTAGAG TGTGCTATTA TGGAGGAGTG 
1551 TGGGATGCCG CAGTGGCTCA TCCGCCTGTA TCACCTTATA AGGTCTGCGT 
1601 GGATCTTGCA GGCCCCGAAG GAGTCTCTGC GAGGGTTTTG GAAGAAACAC 
1651 TCCGGTGAGC CCGGCACTCT TCTATGGAAT ACTGTCTGGA ATATGGCCGT 
1701 TATTACCCAC TGTTATGACT TCCGCGATTT TCAGGTGGCT GCCTTTAAAG 
1751 GTGATGATTC GATAGTGCTT TGCAGTGAGT ATCGTCAGAG TCCAGGAGCT 
1801 GCTGTCCTGA TCGCCGGCTG TGGCTTGAAG TTGAAGGTAG ATTTCCGCCC 
1851 GATCGGTTTG TATGCAGGTG TTGTGGTGGC CCCCGGCCTT GGCGCGCTCC 
1901 CTGATGTTGT GCGCTTCGCC GGCCGGCTTA CCGAGAAGAA TTGGGGCCCT 
1951 GGCCCTGAGC GGGCGGAGCA GCTCCGCCTC GCTGTTAGTG ATTTCCTCCG 
2001 CAAGCTCACG AATGTAGCTC AGATGTGTGT GGATGTTGTT TCCCGTGTTT 
2051 ATGGGGTTTC CCCTGGACTC GTTCATAACC TGATTGGCAT GCTACAGGCT 
2101 GTTGCTGATG GCAAGGCACA TTTCACTGAG TCAGTAAAAC CAGTGCTCGA 
2151 CTTGACAAAT TCAATCTTGT GTCGGGTGGA ATGAATAACA TGTCTTTTGC 
2201 TGCGCCCATG GGTTCGCGAC CATGCGCCCT CGGCCTATTT TGTTGCTGCT 
2251 CCTCATGTTT TTGCCTATGC TGCCCGCGCC ACCGCCCGGT CAGCCGTCTG 
2301 GCCGCCGTCG TGGGCGGCGC AGCGGCGGTT CCGGCGGTGG TTTCTGGGGT 
2351 GACCGGGTTG ATTCTCAGCC CTTCGCAATC CCCTATATTC ATCCAACCAA 
2401 CCCCTTCGCC CCCGATGTCA CCGCTGCGGC CGGGGCTGGA CCTCGTGTTC 
2451 GCCAACCCGC CCGACCACTC GCGTCCGCTT GGCGTGACCA GGCCCAGCGC 
2501 CCCGCCGTTG CCTCACGTCG TAGACCTACC ACAGCTGGGG CCGCGCCGCT 
2551 AACCGCGGTC GCTCCGGCCC CG 

The ability of the methods described herein to 
isolate and identify genetic material from other NANB 
hepatitis, strains has been confirmed by identifying 
genetic material from an isolate obtained in Mexico. 
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The sequence of this isolate was about 75% identical to 
the ET1.1 sequence set forth above. The sequence was 
identified by hybridization using the conditions set 
forth in Section II. B below. A partial cDNA sequence 
consisting of 1661 nucleotides is set forth below. 



5 



Non-A Non-B ET: Mexican Strain 

1 GCGGCCGCTC GGTGGGGTTC CACCTCTCTT CCCATTCCGA ACAGAGAAGT 
10 51 TAAGCAAGGA ATTAATTCGC GGCCGCTCGT GTTGCGTGAG GTGGGTATCT 

101 CAGATGCCAT TGTTAATAAT TTCTTCCTTT CGGGTGGCGA GGTTGGTCAC 
151 CAGAGACCAT CGGTGATTCC GCGAGCAACC CTGACCGCAA TGTTGACGTG 
201 CTTGCGGCGT TTCCACCTTC ATGCCAAATA AGCGCCTTCC ATCAGCTTGC 
250 TGAGGAGCTG GGCCACCGGC GGCGCCGGTG CTGTGCTACC TCCCTGCCCT 
15 301 GAGCTTGAGC AGGGCCTTCT CTATCTGCCA CAGGAGCTAG CCTCCTGTGA 

350 CAGTGTTGTG ACATTTGAGC TAACTGACAT TGTGCACTGC CGCATGGCGG 
401 CCCCTAGCCA AAGGAAAGCT GTTTTGTCCA CGCTGGTAGG CCGGTATGGC 
451 AGACGCACAA GGCTTTATGA TGCGGGTCAC ACCGATGTCC GCGCCTCCCT 
501 TGCGCGCTTT ATTCCCACTC TCGGGCGGGT TACTGCCACC ACCTGTGAAC 
20 551 TCTTTGAGCT TGTAGAGGCG ATGGTGGAGA AGGGCCAAGA CGGTTCAGCC 

601 GTCCTCGAGT TGGATTTGTG CAGCCGAGAT GTCTCCCGCA TAACCTTTTT 
651 CCAGAAGGAT TGTAACAAGT TCACGACCGG CGAGACAATT GCGCATGGCA 
701 AAGTCGGTCA GGGTATCTCC GCCTGGAGTA AGACCTTTTG TGCCCTGTTT 
751 GGCCCCTGGT TCCGTTGCGA TTGAGAAGGG TATTCTATCC CTTTTACCAC 
25 801 AAGCTGTGTT CTACGGGGAT GCTTATGACG ACTCAGTATT CTCTGCTGCC 

851 GTGGCTGGCG CCAGCCATGC CATGGTGTTT GAAAATGATT TTTCTGAGTT 
901 TGACTCGACT CAGAATAACT TTTCCCTAGG TCTTGAGTGC GCCATTATGG 
951 AAGAGTGTGG TATGCCCCAG TGGCTTGTCA GGTTGTACCA TGCCGTCCGG 
1001 TCGGCGTGGA TCCTGCAGGC CCCAAAAGAG TCTTTGAGAG GGTTCTGGAA 
30 1051 GAAGCATTCT GGTGAGCCGG GCACGTTGCT CTGGAATACG GTGTGGAACA 

1101 TGGCAATCAT TGCCCATTGC TATGAGTTCC GGGACCTCCA GGTTGCCGCC 
1151 TTCAAGGGCG ACGACTCGGT CGTCCTCTGT AGTGAATACC GCCAGAGCCC 
1201 AGGCGCCGGT TCGCTTATAG CAGGCTGTGG TTTGAAGCCG AAGGCTGACT 
1251 TCCGGCCGAT TGGGCTGTAT GCCGGGGTTG TCGTCGCCCC GGGGCTCGGG 
35 1301 GCCCTACCCG ATGTCGTTCG ATTCGCCGGA CGGCTTTCGG AGAAGAACTG 

1351 GGGGCCTGAT CCGGAGCGGG CAGAGCAGCT CCGCCTCGCC GTGCAGGATT 
1401 TCCTCCGTAG GTTAACGAAT GTGGCCCAGA TTTGTGTTGA GGTGGTGTCT 
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1451 AGAGTTTACG GGGTTTCCCC GGGTCTGGTT CATAACCTGA TAGGCATGCT 
1501 CCAGACTATT GGTGATGGTA AGGCGCATTT TACAGAGTCT GTTAAGCCTA 
1551 TACTTGACCT TACACACTCA ATTATGCACC GGTCTGAATG AATAACATGT 
1601 GGTTTCCTGC GCCCATGGGT TCGCCACCAT GCGCCCTAGG CCTCTTTTGC 
5 1651 CGAGCGGCCG C 

When comparing the Burmese and Mexican 
strains, 76.7% identity is seen in a 1372 nucleotide 
overlap beginning at nucleotide 13 of the Mexican 

10 strain and nucleotide 331 of the Burmese strain. 

It will be recognized by one skilled in the 
art of molecular genetics that each of the last two 
sequences teaches a corresponding complementary DNA 
sequence as well as RNA sequences corresponding to both 

15 the principal sequence shown and the complementary DNA 
sequence. Additionally, open reading frames encoding 
peptides are present, and expressible peptides are 
disclosed by the nucleotide sequences without setting 
forth the amino acid sequences explicitly, in the same 

20 manner as if the amino acid sequences were explicitly 
set forth as in the ET1.1 sequence above. 



DETAILED DESCRIPTION OF THE INVENTION 

25 

I. Definitions 

The terms defined below have the following 
meaning herein: 

30 

1. "Enterically transmitted non-A/non-B (ET-NANB) 
hepatitis viral agent" means a virus, virus type, or 
virus class which (1) causes water-borne, infectious 
hepatitis, (ii) is transmissible in cynomolgus monkeys, 
35 (iii) is serologically distinct from hepatitis A virus 
(HAV) and hepatitis B virus. (HAB), and (iv) includes a 
genomic region which is homologous to the 1.33 kb cDNA 



.5 
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15 



25 



30 



35 
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insert in plasmid pTZ-KFl(ETl.l) carried in s. C oli 
strain BB4 identified by ATCC deposit number 67717. 

2. Two nucleic acid fragments are "homologous" if 
they are capable of hybridizing to one another under 
hybridization conditions in which hybridized strands 
contain at most about 25-30% basepair mismatches. In 
general, two single-strand nucleic acid species will be 
homologous if they hybridize under the conditions 
described in Maniatis et al. , op. cit. , pp. 320-323, 
but using the following wash conditions: 2 x SCC, 0.1% 
SDS, room temperature - twice, 30 minutes each; then 2 
x SCC, 0.1% SDS, 50 °C - once, 30 minutes; then 2 x SCC, 
room temperature - twice, 10 minutes each. More 
preferably, homologous nucleic acid strands contain 15- 
25% basepair mismatches, even more preferably 5-15% 
basepair mismatches. These degrees of homology can be 
selected by using more stringent wash conditions for 
identification of clones from gene libraries (or other 
sources of genetic material),, as is well known in the 
20 art. 

3.. A DNA fragment is "derived from" an ET-NANB 
viral agent if it has the same or substantially the 
same basepair sequence as a region of the viral agent 
genome . 

4. A protein is "derived from" an ET-NANB viral 
agent if it is encoded by an open reading frame of a 
DNA or RNA fragment derived from an ET-NANB viral 
agent. 

11 • Obtaining Cloned ET-NANB Frag ments 

According to one aspect of the invention, it 
has been found that a virus-specific DNA clone can be 
produced by (a) isolating RNA from the bile of a 
cynomolgus monkey having a known ET-NANB infection, (b) 
cloning the cDNA fragments to form a fragment library, 
and (c) screening the library by differential 
hybridization to radiolabeled cDNAs from infected and 
non-infected bile sources. 
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A. cDNA Fragment Mixture 

ET-NANB infection in cynomolgus monkeys is 
initiated by inoculating the animals intravenously with 
5 a 10% w/v suspension from human case stools positive 
for 27-34 nm ET-NANB particles (mean diameter 32 nm) . 
An infected animal is monitored for elevated levels of 
alanine aminotransferase, indicating hepatitis 
infection. ET-NANB infection is confirmed by 
10 imraunospecif ic binding of seropositive antibodies to 
virus-like particles (VLPs), according to published 
methods (Gravelle). Briefly, a stool (or bile) 
specimen taken from the infected animal 3-4 weeks after 
infection is diluted 1:10 with phosphate-buffered 
15 saline, and the 10% . suspension is clarified by low- 
speed centrifugation and filtration successively 
through 1.2 and 0.45 micron filters. The material may 
be further purified by pelleting through a 30% sucrose 
cushion (Bradley). The resulting preparation of VLPs 
20 is mixed with diluted serum from human patients with 
known ET-NANB infection. After incubation overnight, 
the mixture is centrifuged overnight to pellet immune 
aggregates, and these are stained and examined by 
electron microscopy for antibody binding to the VLPs. 
25 ET-NANB infection can also be confirmed by 

seroconversion to VLP-positive serum. Here the serum 
of the infected animal is mixed as above with 27-34 nm 
VLPs isolated form the stool specimens of infected 
human cases and examined by immune electron microscopy 
30 for antibody binding to the VLPs. 

Bile can be collected from ET-NANB positive 
animals by either cannulating the bile duct and 
collecting the bile fluid or by draining the bile duct 
during necropsy. Total RNA is extracted from the bile 
35 by hot phenol extraction, as outlined in Example 1A. 
The RNA fragments are used to synthesize corresponding 
duplex cDNA fragments by random priming, also as 
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referenced in Example 1A. The cDNA fragments, may be 
fractionated by gel electrophoresis or density gradient 
centrifugation to obtain a desired size class of 
fragments, e.g., 500-4,000 basepair fragments. 
5 Although alternative sources of viral material, 

such as VLPs obtained from stool samples (as described' 
in Example 4), may be used for producing a cDNA 
fraction, the bile source is preferred. According to 
one aspect of the invention, it has been found that 

10 bile from ET-NANB-inf ected monkeys shows a greater 

number of intact viral particles than material obtained 
from stool samples, as evidenced by immune electron 
microscopy. Bile obtained from an ET-NANB infected 
human or cynomolgus monkey/ for use as a source of ET- 

15 NANB viral protein or genomic material, or intact 
virus, forms part of the present invention. 

B. cDNA Library and. Screening 

The cDNA fragments from above are cloned into a 
suitable cloning vector to form a cDNA library. This 
nay be done by equipping blunt-ended fragments with a 
suitable end linker, such as an EcoRI sequence, and 
inserting the fragments into a suitable insertion site 
of a cloning vector, such as at a unique EcoRI site. 
After initial cloning, the library may be recloned, if 
desired, to increase the percentage of vectors 
containing a fragment insert. The library construction 
described in Example IB is illustrative. Here cDNA 
fragments were blunt-ended, equipped with EcoRI ends, 
and inserted into the EcoRI site of the lambda phage 
vector gtlO. The library phage, which showed less than 
5% fragment inserts, was isolated, and the fragment 
inserts recloned into the lambda gtlO vector, yielding 
more than 95% insert-containing phage. 

The cDNA library is screened for sequences 
specific for ET-NANB by differential hybridization to 
cDNA probes derived from infected and non-infected 
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sources, cDNA fragments from infected and non-infected 
source bile or stool viral isolates can be prepared as 
above, Radiolabeling the fragments is by random 
labeling, nick translation, or end labeling, according 
5 to conventional methods (Maniatis, p. 109). The cDNA 
library from above is screened by transfer to duplicate 
nitrocellulose filters, and hybridization with both 
infected-source and non-infected-source (control) 
radiolabeled probes, as detailed in Example 2. In 

10 order to recover sequences that hybridize at the 

preferred outer limit of 25-30% basepair mismatches, 
clones can be selected if they hybridize under the 
conditions described in Maniatis et al. , op, cit. , pp. 
320-323, but using the following wash conditions: 2 x 

15 SCC, 0.1% SDS, room temperature - twice, 30 minutes 

each; then 2 x SCC, 0.1% SDS, 50°C - once, 30 minutes; 
then 2 x SCC, room temperature - twice, 10 minutes 
each. These conditions allowed identification of the 
Mexican isolate discussed above using the ET1.1 

20 sequence as a probe. Plaques which show selective 
hybridization to the infected-source probes are 
preferably replated at low plating density and 
rescreened as above, to isolate single clones which are 
. specific for ET-NANB sequences. As indicated in 

25 Example 2, sixteen clones which hybridized specifically 
with infected-source probes were indentified by these 
procedures. One of the clones, designated lambda gtlO- 
1.1, contained a 1.33 kilobase fragment insert. 

30 C. ET-NANB Sequences 

The basepair sequence of cloned regions of the 
ET-NANB fragments from Part B are determined by 
standard sequencing methods. In one illustrative 
method, described in Example 3, the fragment insert 

35 from the selected cloning vector is excised, isolated 
by gel electrophoresis, and inserted into a cloning 
vector whose basepair sequence on either side of the 
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insertion site is known. The particular vector 
employed in Example 3 is a pTZ-KFl vector shown at the 
left in Figure 1. The ET-NANB fragment from the gtlO- 
1.1 phage was inserted at the unique EcoRl site of the 
5 pTZ-KFl plasmid. Recombinants carrying the desired 
insert were identified by hybridization with the 
isolated 1.33 kilobase fragment, as described in 
Example 3. One selected plasmid, identified as pTZ-KFl 
(ET1.1), gave the expected 1.33 kb fragment after 
10 vector digestion with EcoRI. E. coli strain BB4 
infected with the pTZ-KFl(ETl.l) plasmid has been 
deposited with the American Type Culture Collection, 
Rockville, MD, and is identified by ATCC deposit nubmer 
67717. 

15 The pTZ-KPl(ETl.l) plasmid is illustrated at 

the bottom in Figure 1. The fragment insert has 5' and 
3' end regions denoted at A and C, respectively, and an 
intermediate region, denoted at B. The sequences in 
these regions were determined by standard dideoxy 

20 sequencing and are set forth above. The three short 
sequences (A, B, and C) are from the same insert 
strand. As will be seen in Example 3, the B-region 
sequence was actually determined from the opposite 
strand, so that the B-region sequence shown above 

25 represent the complement of the sequence in the 

sequenced strand. The base numbers of the partial 
sequences are approximate. 

Later work in the laboratory of the inventors 
identified the full sequence, also set forth above. 

30 Fragments of this total sequence can readily be 

prepared using restriction endonucleases. Computer 
analysis of both the forward and reverse sequence has 
identified a number of cleavage sites. The specific 
cleavage sites are summarized (for the forward 

35 direction) in the following tables. 
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114, 
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TCGCGA 


264 


X^UliUsA ( Audi ) 


TCTAGA 
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TTT*AAA(DraI) 


TTTAAA 
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G*TGCAC(ApaLI) 


GTGCAC 
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ACCTGC 


99 
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10 
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216, 
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III. ET-NANB Fragments 
15 According to another aspect, the invention 

includes ET-NANB-specific fragments or probes which 
hybridize with ET-NANB genomic sequences or cDNA 
fragments derived therefrom. The fragments may include 
full-length cDNA fragments such as described in Section 
II, or may be derived from shorter sequence regions 
within cloned cDNA fragments. Shorter fragments can be 
prepared by enzymatic digestion of full-length 
fragments under conditions which yield desired-sized 
fragments , as will be described in Section IV. 
25 Alternatively, the fragments can be produced by 

oligonucleotide synthetic methods, using sequences 
derived from the cDNA fragments. Methods or services 
for producing selected-sequence oligonucleotide 
fragments are available. 
30 To confirm that a given ET-NANB fragment is in 

fact derived from the ET-NANB viral agent, the fragment 
can be shown to hybridize selectively with cDNA from 
infected sources. By way of illustration, to confirm 
that the 1.33 kb fragment in the pTZ-KPl(El.l) plasmid 
35 is ET-NANB in origin, the fragment was excised from the 
pTZ-KFl(ETl.l) plasmid, purified, and radiolabeled by 
random labeling. The radiolabeled fragment was 
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hybridized with fractionated cDNAs from infected and 
non-infected sources to confirm that the probe reacts 
only with infected-source cDNAs. This method is 
illustrated in Example 4, where the above radiolabeled 
5 1.33 kb fragment from pTZ-KFl(ETl.l) plasmid was 

examined for binding to cDNAs prepared from infected 
and non-infected sources. The infected sources are (1) 
bile from a cynomolgus monkey infected with a strain of 
virus derived from stool samples from human patients 
10 from Burma with known ET-NANB infections and (2) a * 
viral agent derived from the stool sample of a human 
ET-NANB patient from Mexico. The cDNAs in each 
fragment mixture were first amplified by a 
linker/primer amplification method described in Example 
15 4. Fragment separation was on agarose gel, followed by 
Southern blotting and then hybridization to bind the 
radiolabeled 1.33 kb fragment to the fractionated. 
cDNAs. The lane containing cDNAs from the infected 
sources showed a smeared band of bound probe, as 
20 expected (cDNAs amplified by the linker/primer 

amplification method would be expected to have a broad 
range of si2es). No probe binding to the amplified 
cDNAs from the non-infected sources was observed,. The 
results indicate that the 1.33 kb probe is specific for 
25 cDNA fragments associated with ET-NANB infection. This 
same type of study, using ET 1.1 as the probe, has 
demonstrated hybridization to ET-NANB samples collected 
from Tashkent, Somalia, Borneo and Pakistan. Secondly, 
the fact that the probe is specific for ET-NANB related 
30 sequences derived from different continents (Asia, 

Africa and North America) indicates the cloned ET-NANB 
Africa sequence is derived from a common ET-NANB virus 
or virus class responsible for ET-NANB hepatitis 
infection worldwide. 
35 In a related confirmatory study, probe binding 

to fractionated genomic fragments prepared from human 
or cynomolgus monkey genomic DNA was examined. No 
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probe binding was observed to either genomic fraction, 
demonstrating that the ET-NANB fragment is not an 
endogenous human or cynoraologus fragment. 

Another confirmation of ET-NANB specific 
5 sequences in the fragments is the ability to express 
ET-NANB proteins from coding regions in the 
fragments. Section IV below discusses methods of 
protein expression using the fragments. 

One important use of the ET-NANB-specific 
10 fragments is for identifying ET-NANB-de rived cDNAs 
which contain additional sequence information. The 
newly identified cDNAs, in turn, yield new fragment 
probes, allowing further iterations until the entire 
viral genome is identified and sequenced. Procedures 
for identifying additional ET-NANB library clones and 
generating new probes therefrom generally follow the 
cloning and selection procedures described in Section 
II. 
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The fragments (and oligonucleotides prepared 
based on the sequences given above) are also useful as 
primers for a polymerase chain reaction method of 
detecting ET-NANB viral genomic material in a patient 
sample. This diagnostic method will be described in 
Section V below. 



IV. ET-NANB Proteins 

As indicated above, ET-NANB proteins can be 
prepared by expressing open reading-frame coding 
regions in ET-NANB fragments, in one. preferred 
approach, the ET-NANB fragments used for protein 
expression are derived from cloned cDNAs which have 
been treated to produce desired-size fragments, and 
preferably random fragments with sizes predominantly 
between about 100 to about 300 base pairs. Example 5 
describes the preparation of such fragments by DNAs 
digestion. Because it is desired to obtain peptide 
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antigens of between about 30 to about 100 amino acids, 
the digest fragments are preferably size fractionated, 
for example by gel electrophoresis, to select those in 
the approximately 100-300 basepair size range. 

A. Expression Vector 

The ET-NANB fragments are inserted into a 
suitable expression vector. One exemplary expression 
vector is lambda gtll, which contains a unique EcoRI 
insertion site 53 base pairs upstream of the 
translation termination codon of the beta-galactosidase 
gene. Thus, the inserted sequence will be expressed as 
a beta-galactosidase fusion protein which contains the 
N- terminal portion of the beta-galactosidase gene, the 
heterologous peptide, and optionally the C-terminal 
region of the beta-galactosidase peptide (the C- 
terminal portion being expressed when the heterologous 
peptide coding sequence does not contain a translation 
termination codon). This vector also produces a 
temperature-sensitive repressor (cl857) which causes 
viral lysogeny at permissive temperatures, e.g., 32°C, 
and leads to viral lysis at elevated temperatures, 
e.g., 42°C. Advantages of this vector include: (1) 
highly efficient recombinant generation, (2) ability to 
select lysogenized host cells on the basis of host-cell 
growth at permissive, but not non-permissive, 
temperatures, and (3) high levels of recombinant fusion 
protein production. Further, since phage containing a 
heterologous insert produces an inactive beta- 
galactosidase enzyme, phage with inserts can be readily 
identified by a beta-galactosidase colored-substrate 
reaction. 

For insertion into the expression vector, the 
viral digest fragments may be modified, if needed, to 
contain selected restriction-site linkers, such as 
EcoRI linkers, according ^ conventional procedures. 
Example 1 illustrates methods for cloning the digest 
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fragments into lambda gtll, which includes the steps of 
blunt-ending the fragments, ligating with EcoRI 
linkers, and introducing the fragments into EcoRl-cut 
lambda gtll. The resulting viral genomic library may 
5 be checked to confirm that a relatively large 

(representative) library has been produced. This can 
be done, in the case of the lambda gtll vector, by 
infecting a suitable bacterial host, plating the 
bacteria, and examining the plaques for loss of beta- 
10 galactosidase activity. Using the procedures described 
in Example 1, about 50% of the plaques showed loss of 
enzyme activity. 

B » Peptide Antigen Expression 
15 The viral genomic library formed above is 

screened for production of peptide antigen (expressed 
as a fusion protein) which is immunoreactive with 
antiserum from ET-NANB seropositive individuals, in a 
preferred screening method, host cells infected with 
phage library vectors are plated, as above, and the 
plate is blotted with a nitrocellulose filter to 
transfer recombinant protein antigens produced by the 
cells onto the filter. The filter is then reacted with 
the ET-NANB antiserum, washed to remove unbound 
25 antibody, and reacted with reporter-labeled, anti-human 
antibody, which becomes bound to the filter, in 
sandwich fashion, through the anti-ET-NANB antibody. 

Typically phage plaques which are identified 
by virtue of their production of recombinant antigen of 
interest are re-examined at a relatively low density 
for production of antibody-reactive fusion protein. 
Several recombinant phage clones which produced 
immunoreactive recombinant antigen were identified in 
the procedure. 

The selected expression vectors may be used for 
scale-up production, for purposes of recombinant 
protein purification. Scale-up production is carried 
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out using one of a variety of reported methods for (a) 
lysogenizing a suitable host, such as E. coli , with a 
selected lambda gtll recombinant (b) culturing the 
transduced cells under conditions that yield high 
5 levels of the heterologous peptide, and (c) purifying 
the recombinant antigen from the lysed cells. 

In one preferred method involving the above 
lambda gtll cloning vector, a high-producer E. coli 
host, BNN103, is infected with the selected library 
10 phage and replica plated on two plates. One of the 
plates is grown at 32°C, at which viral lysogeny can 
occur, and the other at 42°C, at which the infecting 
phage is in a lytic stage and therefore prevents cell 
growth. Cells which grow at the lower but not the 
15 higher temperature are therefore assumed to be 
successfully lysogenized. 

The lysogenized host cells are then grown under 
liquid culture conditions which favor high production 
of the fused protein containing the viral insert, and 
20 lysed by rapid freezing to release the desired fusion 
protein. 

C. Peptide Purification 

The recombinant peptide can be purified by 

25 standard protein purification procedures which may 
include differential precipitation, molecular sieve 
chromatography, ion-exchange chromatography, 
isoelectric focusing, gel electrophoresis and affinity 
chromatography. In the case of a fused protein, such 

30 as the beta-galactosidase fused protein prepared as 

above, the protein isolation techniques which are used 
can be adapted from those used in isolation of the 
native protein. Thus, for isolation of a beta- 
glactosidase fusion protein, the protein can be 

35 isolated readily by simple affinity chromatography, by 
passing the cell lysis material over a solid support 
having surface-bound anti-beta-galactosidase antibody. 
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D. Viral Proteins 

The ET-NANB protein of the invention may also 
be derived directly from the ET-NANB viral agent. VLPs 
5 isolated from a stool sample from an infected 

individual, as above, are one suitable source of viral 
protein material. The VLPs isolated from the stool 
sample may be further purified by affinity 
chromatography prior to protein isolation (see 

10 below) . The viral agent may also be raised in cell 
culture, which provides a convenient and potentially 
concentrated source of viral protein. Co-owned U.S. 
Patent Application Serial No. 846,757, filed April 1, 
1986, describes an immortalized trioma liver cell which 

15 supports NANB infection in cell culture. The trioma 
cell line is prepared by fusing human liver cells with 
a mouse/human fusion partner selected for human 
chromosome stability. Cells containing the desired 
NANB viral agent can be identified by immuno- 

20 fluorescence methods, employing anti-ET-NANB human 
antibodies. 

The viral agent is disrupted, prior to protein 
isolation, by conventional methods, which can include 
sonicatioh, high- or low-salt conditions, or use of 
25 detergents. 

Purification of ET-NANB viral protein can be 
carried out by affinity chromatography, using a 
purified anti-ET-NANB antibody attached according to 
standard methods to a suitable solid support. The 

30 antibody itself may be purified by affinity 

chromatography, where an immunoreactive recombinant 
ETNANB protein, such as described above, is attached to 
a solid support, for isolation of anti-ET-NANB 
antibodies from an immune serum source. The bound 

35 antibody is released from the support by standard 
methods. 
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Alternatively, the anti-ET-NANB antibody may be 
a monoclonal antibody (Mab) prepared by immunizing a 
mouse or other animal with recombinant ET-NANB protein, 
isolating lymphocytes from the animal and immortalizing 
5 the cells with a suitable fusion partner, and selecting 
successful fusion products which react with the 
recombinant protein immunogen. These in turn may be 
used in affinity purification procedures, described 
above, to obtain native ET-NANB antigen. 

10 

V. Utility 



A. Diagnostic Methods 

The particles and antigens of the invention,, as 

15 well as the genetic material, can be used in diagnostic 
assays. Methods for detecting the presence of ET-NANB 
hepatitis comprise analyzing a biological sample such 
as a blood sample, stool sample or liver biopsy 
specimen for the presence of an analyte associated with 

20 ET-NANB hepatitis virus. 

The analyte can be a nucleotide sequence which 
hybridizes with a probe comprising a sequence of at 
least about 16 consecutive nucleotides, usually 30 to 
200 nucleotides, up to substantially the full sequence 

25 of the sequences shown above (cDNA sequences). The 

analyte can be RNA or cDNA. The analyte is typically a 
virus particle suspected of being ET-NANB or a particle 
for which this classification is being ruled out. The 
virus particle can be further characterized as having 

30 an RNA viral genome comprising a sequence at least 
about 80% homologous to a sequence of at least 12 
consecutive nucleotides of the "forward" and "reverse" 
sequences given above, usually at least about 90% 
homologous to at least about 60 consecutive nucleotides 

35 within the sequences, and may comprise a sequence 
substantially homologous to the full-length 
sequences. In order to detect an analyte, where the 
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analyte hybridizes to a probe, the probe may contain a 
detectable label. 

• The analyte can also comprise an antibody which 
recognizes an antigen, such as a cell surface antigen, 
5 on a ET-NANB virus particle. The analyte can also be a 
ET-NANB viral antigen. Where the analyte is an 
antibody or ah antigen, either a labelled antigen or 
antibody, respectively, can be used to bind to the 
analyte to form an immunological complex, which can 
then be detected by means of the label. 

Typically, methods for detecting analytes such 
as surface antigens and/or whole particles are based on 
immunoassays. Immunoassays can be conducted either to 
determine the presence of antibodies in the host that 
15 have arisen from infection by ET-NANB hepatitis virus 
or by assays that directly determine the presence of 
virus particles or antigens. Such techniques are well 
known and need not be described here in detail. Exam- 
ples include both heterogeneous and homogeneous immuno- 
20 assay techniques. Both techniques are based on the 

formation of an immunological complex between the virus 
particle or its antigen and a corresponding specific 
antibody. Heterogeneous assays for viral antigens 
typically use a specific monoclonal or polyclonal 
25 antibody bound to a solid surface. Sandwich assays are 
becoming increasingly popular. Homogeneous assays, 
which are carried out in solution without the presence 
of a solid phase, can also be used, for example by 
determining the difference in enzyme activity brought 
on by binding of free antibody to an enzyme-antigen 
conjugate. A number of suitable assays are disclosed 
in U.S. Patent Nos. 3,817,837, 4,006,360, 3,996,345. 

When assaying for the presence of : antibodies 
induced by ET-NANB viruses, the viruses and antigens of 
the invention can be used as specific binding agents to 
detect either IgG or igM tfctibodies . Since IgM anti- 
bodies are typically the. first antibodies that appear 
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during the course of an infection, when IgG synthesis 
may not yet have been initiated, specifically distin- 
guishing between IgM and IgG antibodies present in the 
blood stream of a host will enable a physician or other 
investigator to determine whether the infection is 
recent or chronic. 

In one diagnostic configuration, test serum is 
reacted with a solid phase reagent having surface-bound 
ET-NANB protein antigen. After binding anti-ET-NANB 
antibody to the reagent and removing unbound serum 
components by washing, the reagent is reacted with 
reporter-labeled anti-human antibody to bind reporter 
to the reagent in proportion to the amount of bound 
anti-ET-NANB antibody on the solid support. The 
reagent is again washed to remove unbound labeled 
antibody, and the amount of reporter associated with 
the reagent is determined. Typically, the reporter is 
an enzyme which is detected by incubating the solid 
phase in the presence of a suitable fluorometric or 
colorimetric substrate. 

The solid surface reagent in the above assay 
prepared by known techniques for attaching protein 
material to solid support material, such as polymeric 
beads, dip sticks, or filter material. These 
attachment methods generally include non-specific 
adsorption of the protein to the support or covalent 
attachment of the protein, typically through a free 
amine group, to a chemically reactive group on the 
solid support, such as an activate carboxyl, hydroxyl, 
or aldehyde group. 

In a second diagnostic configuration, known as 
a homogeneous assay, antibody binding to a solid 
support produces some change in the reaction medium 
which can be directly detected in the medium. Known 
general types of homogeneous assays proposed heretofore 
include (a) spin-labeled reporters, where antibody 
binding to the antigen is detected by a change in 
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reported mobility (broadening of the spin splitting 
peaks), (b) fluorescent reporters, where binding is 
detected by a change in fluorescence efficiency, (c) 
enzyme reporters, where antibody binding effects 
5 enzyme/substrate interactions, and (d) liposome-bound 
reporters, where binding leads to liposome lysis and 
release of encapsulated reporter. The adaptation of 
these methods to the protein antigen of the present 
invention follows conventional methods for preparing 
homogeneous assay reagents. 

In each of the assays described above, the 
assay method involves reacting the serum from a test 
individual with the protein antigen and examining the 
antigen for the presence of bound antibody. The 
15 examining may involve attaching a labeled anti-human 
antibody to the antibody being examined, either igM 
(acute phase) or IgG (convalescent phase), and 
measuring the amount of reporter bound to the solid 
support, as in the first method, or may involve 
20 observing the effect of antibody binding on a 

homogeneous assay reagent, as in the second method. 

Also forming part of the invention is an assay 
system or kit for carrying out the assay method just 
described. The kit generally includes a support with 
25 surface^bound recombinant protein antigen which is (a) 
immunoreactive with antibodies present in individuals 
infected with enterically transmitted nonA/nonB viral 
agent and (b) derived from a viral hepatitis agent 
whose genome contains a region which is homologous to 
the 1.33 kb DNA EcoRl insert present in plasmid pTZ- 
KPl(ETl.l) carried in E. Coll strain BB4, and having 
ATCC deposit no. 67717. A reporter-labeled anti-human 
antibody in the kit is used for detecting surface-bound 
anti-ET-NANB antibody. 



30 



35 



WO 89/12462 PCT/US89/02648 



37 



B - Viral Genome Diagnostic Applications 

The genetic material of the invention can it- 
self be used in numerous assays as probes for genetic 
material present in naturally occurring infections. 
5 One method for amplification of target nucleic acids, 
for later analysis by hybridization assays, is known as 
the polymerase chain reaction or PCR technique. The 
PCR technique can be applied to detecting virus 
particles of the invention in suspected pathological 
10 samples using oligonucleotide primers spaced apart from 
each other and based on the genetic sequence set forth 
above. The primers are complementary to opposite 
strands of a double stranded DNA molecule and are 
typically separated by from about 50 to 450 nt or more 
15 (usually not more than 2000 nt). This method entails 

preparing the specific oligonucleotide primers and then 
repeated cycles of target DNA denaturation, primer 
binding, and extension with a DNA polymerase to obtain 
DNA fragments of the expected length based on the 
20 primer spacing. Extension products generated from one 
primer serve as additional target sequences for the 
other primer. The degree of amplification of a target 
sequence is controlled by the number of cycles that are 
performed and is theoretically calculated by the simple 
25 formula 2 n where n is the number of cycles. Given that 
the average efficiency per cycle ranges from about 65% 
to 85% , 25 cycles produce from 0.3 to 4.8 million 
copies of the target sequence. The PCR method is de- 
scribed in a number of publications, including Saiki et 
30 al., Science (1985) 230:1350-1354; Saiki et al., Nature 
(1986) 324:163-166; and Scharf et al M Science (1986) 
233:1076-1078. Also see U.S. Patent Nos. 4,683,194; 
4,683,195; and 4,683,202. 

The invention includes a specific diagnostic 
35 method for determination of ET-NANB viral agent, based 
on selective amplification of ET-NANB fragments. This 
method employs a pair of single-strand primers derived 
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from non-homologous regions of opposite strands of a 
DNA duplex fragment, which in turn is derived from an 
enterically transmitted viral hepatitis agent whose 
genome contains a region which is homologous to the 
1.33 kb DNA EcoRI insert present in plasmid pTZ- 
KFl(ETl.l) carried in E. coli strain BB4, and having 
ATCC deposit no. 67717. These "primer fragments," 
which form one aspect of the invention, are prepared 
from ET-NANB fragments such as described in Section in 
above. The method follows the process for amplifying 
selected nucleic acid sequences as disclosed in U.S. 
Patent No. 4,683,202, as discussed above. 
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C. Peptide Vaccine 

Any of the antigens of the invention can be 
used in preparation of a vaccine. A preferred starting 
material for preparation of a vaccine is the particle 
antigen isolated from bile. The antigens are 
preferably initially recovered as intact particles as 
20 described above. However, it is also possible to pre- 
pare a suitable vaccine from particles isolated from 
other sources or non-particle recombinant antigens. 
When non-particle antigens are used (typically soluble 
antigens), proteins derived from the viral envelope or 
viral capsid are preferred for use in preparing vac- 
cines. These proteins can be purified by affinity 
chromatography, also described above. 

If the purified protein is not immunogenic per 
se, it can be bound to a carrier to make the protein 
immunogenic. Carriers include bovine serum albumin, 
keyhole limpet hemocyanin and the like. It is desir- 
able, but not necessary, to purify antigens to be sub- 
stantially free of human protein. However, it is more 
important that the antigens be free of proteins, 
viruses, and other substances not of human origin that 
may have been introduced by way of, or contamination 
of, the nutrient medium, cell lines, tissues, or patho- 
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logical fluids from which the virus is cultured or 
obtained. 

Vaccination can be conducted in conventional 
fashion. For example, the antigen, whether a viral 
5 particle or a protein, can be used in a suitable 
diluent such as water, saline, buffered salines, 
complete or incomplete adjuvants, and the like. The 
iramunogen is administered using standard techniques for 
antibody induction, such as by subcutaneous administra- 

10 tion of physiologically compatible, sterile solutions 
containing inactivated or attenuated virus particles or 
antigens. An immune response producing amount of virus 
particles is typically administered per vaccinizing 
injection, typically in a volume of one milliliter or 

15 less. 

A specific example of a vaccine composition 
includes, in a pharmacologically acceptable adjuvant, a 
recombinant protein or protein mixture derived from an 
enterically transmitted nonA/nonB viral hepatitis agent 

20 whose genome contains a region which, is homologous to 
the 1.33 kb DNA EcoRI insert present in plasmid pTZ- 
KFl(ETl.l) carried in E. coli strain BB4, and having 
ATCC deposit no. 67717. The vaccine is administered at 
periodic intervals until a significant titer of anti- 

25 ET-NANB antibody is detected in the serum. The vaccine 
is intended to protect against ET-NANB infection. 

Prophylactic and Therapeutic 
Antibodies and Antisera 

30 In addition to use as a vaccine, the composi- 

tions can be used to prepare antibodies to ET-NANB 
virus particles. The antibodies can be used directly 
as antiviral agents. To prepare antibodies, a host 
animal is immunized using the virus particles or, as 

35 appropriate, non-particle antigens native to the virus 
particle are bound to a carrier as described above for 
vaccines. The host serum or plasma is collected 
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following an appropriate time interval to provide a 
composition comprising antibodies reactive with the 
virus particle. The gamma globulin fraction or the IgG 
antibodies can be obtained, for example, by use of 
5 saturated ammonium sulfate or DEAE Sephadex, or other 
techniques known to those skilled in the art. The 
antibodies are substantially free of many of the 
adverse side effects which may be associated with other 
anti-viral agents such as drugs. 

The antibody compositions can be made even more 
compatible with the host system by minimizing potential 
adverse immune system responses. This is accomplished 
by removing all or a portion of the Fc portion of a 
foreign species antibody or using an antibody of the 
15 same species as the host animal, for example, the use 
of antibodies from human/human hybridomas. 

The antibodies can also be used as a means of 
enhancing the immune response since antibody-virus 
complexes are recognized by macrophages. The anti- 
20 bodies can be administered in amounts similar to those 
used for other therapeutic administrations of anti- 
body. For example, pooled gamma globulin is admini- 
stered at 0.02-0.1 ml/lb body weight during the early 
incubation of other viral diseases such as rabies, 
25 measles and hepatitis B to interfere with viral entry 
into cells. Thus, antibodies reactive with the ET-NANB 
virus particle can be passively administered alone or 
in conjunction with another anti-viral agent to a host 
infected with an ET-NANB virus to enhance the immune 
response and/or the effectiveness of an antiviral drug. 

Alternatively, anti-ET-NANB-virus antibodies 
can be induced by administering anti-idiotype anti- 
bodies as immunogens. Conveniently, a purified antiET- 
NANB-virus antibody preparation prepared as descibed 
35 above is used to induce anti-idiotype antibody in a 
host animal. The composition is administered to the 
host animal in a suitable diluent. Following 
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administration, usually repeated administration, the 
host produces anti-idiotype antibody. To eliminate an 
immunogenic response to the Pc region, antibodies pro- 
duced by the same species as the host animal can be 
5 used or the Pc region of the administered antibodies 
can be removed. Following induction of anti-idiotype 
antibody in the host animal, serum or plasma is removed 
to provide an antibody composition. The composition 
can be purified as described above for anti-ET-NANB- 
10 virus antibodies, or by affinity chromatography using 
an ti-ET-NANB- virus antibodies bound to the affinity 
matrix. The anti-idiotype antibodies produced are 
similar in conformation to the authentic ET-NANB 
antigen and may be used to prepare an ET-NANB vaccine 
15 rather than using a ET-NANB particle antigen. 

When used as a means of inducing an t i -ET-NANB- 
virus antibodies in a patient, the manner of injecting 
the antibody is the same as for vaccination purposes, 
namely intramuscularly, intraperitoneal^, subcutane- 
20 ously or the like in an effective concentration in a 
physiologically suitable diluent with or without adju- 
vant. One or more booster injections may be desirable. 
The anti-idiotype method of induction of an t i -ET-NANB- 
virus antibodies can alleviate problems which may be 
25 caused by passive administration of anti-ET-NANB-virus 
antibodies, such as an adverse immune response, and 
those associated with administration of purified blood 
components, such as infection with as yet undiscovered 
viruses. 

30 The ET-NANB derived proteins of the invention 

are also intended for use in producing antiserum 
designed for pre- or post-exposure prophylaxis. Here 
an ET-NANB protein, or mixture of proteins is 
formulated with a suitable adjuvant and administered by 

35 injection to human volunteers, according to known 
methods for producing hum4S antisera. Antibody 
response to the injected proteins is monitored, during 
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a several- week period following immunization, by 
periodic serum sampling to detect the presence an anti- 
ET-NANB serum antibodies, as described in Section HA 
above . 

5 The antiserum from immunized individuals may be 

administered as a pre-exposure prophylactic measure for 
individuals who are at risk of contracting infection. 
The anitserum is also useful in treating an individual 
post-exposure, analogous to the use of high titer 
antiserum against hepatitis B virus for post-exposure 
prophylaxis . 
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E. Monoclonal Antibodies 

For both in vivo use of antibodies to ET-NANB 
virus particles and proteins and anti-idiotype anti- 
bodies and diagnostic use, it may be preferable to use 
monoclonal antibodies. Monoclonal anti-virus particle 
antibodies or anti-idiotype antibodies can be produced 
as follows. The spleen or lymphocytes from an 
immunized animal are removed and immortalized or used 
to prepare hybridomas by methods known to those skilled 
in the art. To produce a human-human hybridoma, a 
human lymphocyte donor is selected. A donor known to 
be infected with a ET-NANB virus (where infection has 
25 been shown for example by the presence of anti-virus 
antibodies in the blood or by virus culture) may serve 
as a suitable lymphocyte donor. Lymphocytes can be 
isolated from a peripheral blood sample or spleen cells 
may be used if the donor is subject to splenectomy. 
Epstein-Barr virus (EBV) can be used to immortalize 
human lymphocytes or a human fusion partner can be used 
to produce human-human hybridomas. Primary in vitro 
immunization with peptides can also be used in the 
generation of human monoclonal antibodies. 

Antibodies secreted by the immortalized cells 
are screened to determine the clones that secrete anti- 
bodies of the desired specificity. For monoclonal 
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anti-virus particle antibodies, the antibodies must 
bind to ET-NANB virus particles. For monoclonal anti- 
idiotype antibodies, the antibodies must bind to anti- 
virus particle antibodies. Cells producing antibodies 
5 of the desired specificity are selected. 

The following examples illustrate various 
aspects of the invention, but are in no way intended to 
limit the scope thereof. 

10 Material 

The materials used in the following Examples 
were as follows: 

Enzymes: DNAse I and alkaline phosphatase were 
obtained from Boehringer Mannheim Biocheraicals (BMB, 
15 Indianapolis, IN); EcoRI, EcoRI methylase, DNA ligase, 
and DNA Polymerase I, from New England Biolabs (NEB, 
Beverly MA); and RNase A was obtained from Sigma (St. 
Louis, MO). 

Other reagents: EcoRI linkers were obtained 

20 from NEB; and nitro blue tetrazolium (NBT) , 
5-bromo-4-chloro-3-indolyl phosphate (BGIP) 
5-bromo-4-chlor o-3-indolyl-B-D-galactopyranoside ( X- 
gal) and isopropyl B-D-thiogalactopyranoside (IPTG) 
were obtained from Sigma. 

25 cDNA synthesis kit and random priming labeling 

kits are available from Boehringer-Mannheim Biochemical 
(BMB, Indianapolis, IN). 

Eample 1 

30 Preparing cDNA Library 

A. Source of ET-NANB virus 

Two cynomolgus monkeys (cynos) were 
intravenously injected with a 10% suspension of a stool 
35 pool obtained from a second-passage cyno (cyno #37) 

infected with a strain of ET-NANB virus isolated from 
Burma cases whose stools were positive for ET-NANB, as 
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evidenced by binding of 27-34 nm virus-like particles 
{VLPs) in the stool to immune serum from a known ET- 
NANB patient. The animals developed elevated levels, of 
alanine aminotransferase (ALT) between 24-36 days after 
5 innoculation, and one excreted 27-34 nm VLPs in its 
bile in the pre-acute phase of infection. 

The bile duct of each infected animal was 
cannulated and about 1-3 cc of bile was collected 
daily, rna was extracted from one bile specimen (cyno 
#121) by hot phenol extraction, using a standard RNA 
isolation procedure. Double-strand cDNA was formed 
from the isolated RNA by a random primer for first- 
strand generation, using a cDNA synthesis kit obtained 
from Boehringer-Mannheim (Indianapolis, IN). 



20 



B. Cloning the Duplex Fragments 

The duplex cDNA fragments were blunt-ended with 
T4 DNA polymerase under standard conditions {Maniatis, 
p. 118), then extracted with phenol/chloroform and 
precipitated with ethanol. The blunt-ended material 
was ligated with EcoRI linkers under standard 
conditions (Maniatis, pp. 396-397) and digested with 
EcoRI to remove redundant linker ends. Non-ligated 
linkers were removed by sequential isopropanol 
25 precipitation. 

Lambda gtlO phage vector (Huynh) was obtained 
from Promega Biotec (Madison, MI). This cloning vector 
has a unique EcoRI cloning site in the phage cl 
repressor gene. The cDNA fragments from above were 
introduced into the EcoRI site by mixing 0.5 - l.o ug 
EcoRI-cleaved gtlO, 0.5-3 yl of the above duplex 
fragments, 0.5 „1 10X ligation buffer, 0.5 pi ligase 
(200 units), and distilled water to 5 „l. The mixture 
was incubated overnight at 14-C, followed by in vitro 
packaging, according to standard methods (Maniatis, pp. 
256-268). 
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The packaged phage were used to infect an 
coli hfl strain, such as strain HG415. Alternatively, 
E « coli, strain C600 hfl, avialable from Promega 
Biotec f Madison, WI, could be used. The percentage of 
5 recombinant plaques obtained with insertion of the 
EcoRI-ended fragments was less than 5% by analysis of 
20 random plaques. 

The resultant cDNA library was plated and phage 
were eluted from the selection plates by addition of 

10 elution buffer. After DNA extraction from the phage, 
the DNA was digested with EcoRI to release the 
heterogeneous insert population, and the DNA fragments 
were fractionated on agarose to remove phage 
fragments. The 500-4,000 basepair inserts were 

15 isolated and recloned into lambda gtlO as above, and 
the packaged phage was used to infect E. coli strain 
HG415. The percentage of successful recombinants was 
greater than 95%. The phage library was plated on E. 
coli strain HG41S, at about 5,000 plaques/plate, on a 

20 total of 8 plates* 
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Example 2 
Selecting ET-NANB Cloned Fragments 
A. cDNA Probes 

Duplex cDNA fragments from noninfected and ET- 
5 NANB-infected cynomolgus monkeys were prepared as in 
Example 1. The cDNA fragments were radiolabeled by 
random priming, using a random-priming labeling kit 
obtained from Boehringer-Mannheim (Indianapolis, IN). 

10 B. Clone Selection 

The plated cDNA library from Example 1 was 
transferred to each of two nitrocellulose filters, and 
the phage DNA was fixed on the filters by baking, 
according to standard methods (Maniatis, pp. 320- 

15 323). The duplicate filters were hybridized with 
either, infected-source or control cDNA probes from 
above. Autoradiographs of the filters were examined to 
identify library clones which hybridized with 
radiolabeled cDNA probes from infected source only, 

20 i.e., did not hybridize with cDNA probes from the non- 
infected source. Sixteen such clones, out of a total 
of about 40,000 clones examined, were identified by 
this subtraction selection method. 

Each of the sixteen clones was picked and 

25 replated at low concentration on an agar plate. The 
clones on each plate were transferred to two nitro- 
cellulose, as duplicate lifts, and examined for hybrid- 
ization to radiolabeled cDNA probes from infected and 
noninfected sources, -as above. Clones were selected 

30 which showed selective binding for infected-source 

probes (i.e., binding with i.nfected-source probes and 
substantially no binding with non-infected-source 
probes). One of the clones which bound selectively to 
probe from infected source was isolated for further 

35 study. The selected vector was identified as lambda 
gtlO-1.1, indicated in Figure 1. 
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Example 3 
ET-NANB Sequence 



10 



15 



20 



25 



30 



Clone lambda gtlO-1.1 from Example 2 was 
digested with EcoRI to release the heterologous insert, 
which was separated from the vector fragments by gel 
electrophoresis- The electrophoretic mobility of the 
fragment was consistent with a 1.33 kb fragment. This 
fragment, which contained EcoRI ends, was inserted into 
the EcoRI site of a pTZ-KFl vector, whose construction 
and properties are described in co-owned U.S. patent 
application for "Cloning Vector System and Method for 
Rare Clone Identification", Serial No. 125, 650, filed 
November 25, 1987. Briefly, and as illustrated in 
Figure 1, this plasmid contains a unique Eco RI site 
adjacent a T7 polymerase promoter site, and plasmid and 
phage origins of replication. The sequence immediately 
adjacent each side of the EcoRI site is known. E. coli 
BB4 bacteria, obtained from Stratagene (La Jolla, CA, 
were transformed with the plasmid. 

Radiolabeled ET-NANB probe was prepared by 
excising the 1.33 kb insert from the lambda gtlO-1.1 
phage in Example 2, separating the fragment by gel 
electrophoresis, and randomly labeling as above. 
Bacteria transfected with the above pTZ-KFl and 
containing the desired ET-NANB insert were selected by 
replica lift and hybridization with the radiolabeled 
ET-NANB probe, according to methods outlined in Example 
2. 

One bacterial colony containing a successsful 
recombinant was used for sequencing a portion of the 
1.33 kb insert. This isolate, designated pTZ- 
KFl(ETl.l), has been deposited with the American Type 
Culture Collection, and is identified by ATCC deposit 
no. 67717. Using a standard dideoxy sequencing 
procedure, and primers for the sequences flanking the 
EcoRI site, about 200-250 basepairs of sequence from 
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the 5 '-end region and 3 '-end region of the insert were 
obtained. The sequences are given above in Section 
II. Later sequencing by the same techniques gave the 
full sequence in both directions, also given above/ 

5 

Example 4 
Detecting ET-NANB Sequences 
cDNA fragment mixtures from the bile of 
noninfected and ET-NANB-infected cynomolgus monkeys 

10 were prepared as above. The cDNA fragments obtained 
from human stool samples were prepared as follows. 
Thirty ml of a 10% stool suspension obtained from an 
individual from Mexico diagnosed as infected with ET- 
NANB as a result of an ET-NANB outbreak, and a similar 

15 volume of stool from a healthy, non-infected 

individual, were layered over a 30% sucrose density 
gradient cushion, and centrifuged at 25,000 xg for 6 hr 
in an SW27 rotor, at.l5°C. The pelleted material from 
the infected-source stool contained 27-34 nm VLP 

20 particles characteristic of ET-NANB infection in the 
infected-stool sample. RNA was isolated from the 
sucrose-gradient pellets in both the infected and non- 
infected samples, and the isolated RNA was used to 
produce cDNA fragments as described in Example 1. 

25 

The cDNA fragment mixtures from infected and 
non-infected bile source, and from infected and non- 
infected human-stool source were each amplified by a 
novel linker/primer replication method described in co- 

30 owned patent application serial number 07/208,512 for 
"DNA Amplification and Subtraction Technique," filed 
June 17, 1988. Briefly, the fragments in each sample 
were blunt-ended with DNA Pol I then extracted with 
phenol/chloroform and precipitated with ethanol. The 

35 blunt-ended material was ligated with linkers having 
the following sequence: 
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5 * -GGAATTCGCGGCCGCTCG-3 1 
3 ' -TTCCTTAAGCGCCGGCGAGC-5 1 

The duplex fragements were digested with Nrul 
5 to remove linker dimers, mixed with a primer having the 
sequence 5 1 -GGAATTCGCGGCCGCTCG-3 1 , and then heat 
denatured and cooled to room temperature to form 
single-strand DNA/primer complexes. The complexes were 
replicated to form duplex fragments by addition of 
10 Thermus aquaticus (Taq) polymerase and all four 
deoxynucleotides. The replication procedures, 
involving successive strand denaturation, formation of 
strand/primer complexes, and replication, was repeated 
25 times. 

15 The amplified cDNA sequences were fractionated 

by agarose gel electrophoresis, using a 2% agarose 
matrix. After transfer of the DNA fragments from the 
agarose gels to nitrocellulose paper, the filters were 
hybridized to a random-labeled 32 P probe prepared by 

20 (i) treating the pTZ-KFl(ETl.l) plasmid from above with 
EcoRI , (ii) isolating the released 1.33 kb ET-NANB 
fragment, and (iii) randomly labeling the isolated 
fragment. The probe hybridization was performed by 
conventional Southern blotting methods ( Mania t is, pp. 

25 382-389). Figure 2 shows the hybridization pattern 

obtained with cDNAs from infected (I) and non-infected 
(N) bile sources (2A) and from infected (I) and non- 
infected (N) human stool sources (2B). As seen, the 
ET-NANB probe hybridized with fragments obtained from 

30 both of the infected sources, but was non-homologous to 
sequences obtained from either of the non-infected 
sources, thus confirming the specificity of derived 
sequence. 

Southern blots of the radiolabeled 1.33 kb 
35 fragment with genomic DNA fragments from both human and 
cynomologus-monkey DNA were also prepared. No probe 
hybridization to either of the genomic fragment 
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mixtures was observed, confirming that the ET-NANB 
sequence is exogenous to either human or cynomolgus 
genome. 

5 Example 5 

Expressing ET-NANB Proteins 

A. Preparing ET-NANB Coding Sequences 

The pTZ-KPl(ETl.l) plasmid from Example 2 was 

10 digested with EcoRI to release the 1.33 kb ET-NANB 

insert which was purified from the linearized plasmid 
by gel electrophoresis. The purified fragment was 
suspended in a standard digest buffer (0.5M Tris HC1, 
pH 7.5; 1 mg/ml BSA; lOmM MnCl 2 ) to a concentration of 

15 about 1 mg/ml and digested with DNAse I at room 
temperature for about 5 minutes. These reaction 
conditions were determined from a prior calibration 
study, in which the incubation time required to produce 
predominantly 100-300 basepair fragments was 

20 determined. The material was extracted with 
phenol/chloroform before ethanol precipitation. 

The fragments in the digest mixture were blunt- 
ended and ligated with EcoRI linkers as in Example 1. 
The resultant fragments were analyzed by 

25 electrophoresis (5-10V/cm) on 1.2% agarose gel, using 
PhiX174/HaeIIl and lambda/Hindlll size markers. The 
100-300 bp fraction was eluted onto NA45 strips 
(Schleicher and Schuell), which were then placed into 
1.5 ml microtubes with eluting solution (1 M NaCl, 50 

30 mH arginine, pH 9.0), and incubated at 67°C for 30-60 
minutes. The eluted DNA was phenol/chloroform 
extracted and then precipitated with two volumes of 
ethanol. The pellet was resuspended in 20 ul TE (0.01 
M Tris HC1, pH 7.5,^0.001 M EDTA) . 

35 ■ * : - . 



10 
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B. Cloning in an Expression Vector 

Lambda gtll phage vector (Huynh) was obtained 
from Promega Biotec (Madison, WI). This cloning vector 
has a unique EcoRI cloning site 53 base pairs upstream 
5 from the beta-galactosidase translation termination 
codon. The genomic fragments from above were 
introduced into the EcoRI site by mixing 0.5-1.0 ug 
EcoRl-cleaved gtll, 0.3-3 ul of the above sized 
fragments, 0.5 ul 10X ligation buffer (above), 0.5 „1 
ligase (200 units), and distilled water to 5 ul. The 
mixture was incubated overnight at 14°C, followed by in 
vitro packaging, according to standard methods 
(Maniatis, pp. 256-268). 

The packaged phage were used to infect e. coli 
15 strain KM392, obtained from Dr. Kevin Moore, DNAX (Palo 
Alto, CA). Alternatively, E. Coli strain Y1090, 
available from the American Type Culture Collection 
(ATCC #37197), could be used. The infected bacteria 
were plated and the resultant colonies were checked for 
loss of beta-galactosidase activity-(clear plaques) in 
the presence of X-gal using a standard X-gal substrate 
plaque assay method (Maniatis). About 50% of the phage 
plaques showed loss of beta-galactosidase enzyme 
activity ( recombinants ) . 

25 

C. Screening for ET-NANB Recombinant Proteins 

ET-NANB convalescent .antiserum was obtained 
from patients infected during documented ET-NANB 
outbreaks in Mexico, Borneo, Pakistan, Somalia, and 
30 Burma. The sera were immunoreactive with VLPs in stool 
specimens from each of several other patients with ET- 
NANB hepatitis. 

A lawn of E. coli KM392 cells infected with 
about 10 4 pfu of the phage stock from above was 
35 prepared on a 150 mm plate and incubated, inverted, for 
5-8 hours at 37°C. The lawn was overlaid with a 
nitrocellulose sheet, causing transfer of expressed ET- 



20 
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NANB recombinant protein from the plaques to the 
paper. The plate and filter were indexed for matching 
corresponding plate and filter positions. 

The filter was washed twice in TBST buffer (10 
5 mH Tris, pH 8.0, 105 mH NaCl, 0.05% Tween 20), blocked 
with AIB (TBST buffer with 1% gelatin), washed again in 
TBST, and incubated overnight after addition of 
antiserum (diluted to 1:50 in AIB, 12-15 ml/plate). 
The sheet was washed twice in TBST and then contacted 

10 with enzyme-labeled anti-human antibody to attach the 
labeled antibody at filter sites containing antigen 
recognized by the antiserum. After a final washing, 
the filter was developed in a substrate medium 
containing 33 ul NBT (50 mg/ml stock solution 

15 maintained at 5°C) mixed with 16 ul BCIP (50 mg/ml 

stock solution maintained at 5°C) in 5 ml of alkaline 
phosphatase buffer (100 mM Tris, 9.5, 100 mH NaCl, 5 mM 
MgCl 2 ). Purple color appeared at points of antigen 
production, as recognized by the antiserum. 

20 

D. Screening Plating 

The areas of antigen production determined in 
the previous step were replated at about 100-200 pfu on 
an 82 mm plate. The above steps, beginning with a 5-8 
25 hour incubation, through NBT-BCIP development, were 
repeated in order to plaque purify phage secreting an 
antigen capable of reacting with the ET-NANB 
antibody. The identified plaques were picked and 
eluted in phage buffer (Maniatis, p. 443). 

30 

E. Epitope Identification 

A series of subclones derived from the original 
pTZ-KPl (ET1.1) plasmid from Example 2 were isolated 
using the same, techniques described above. Each of 
35 these five subclones were immunoreactive with a pool of 
anti-ET antisera noted in C. The subclones contained 
short sequences from the "reverse" sequence set forth 
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previously. The beginning and ending points of the 
sequences in the subclones (relative to the full 
"reverse" sequence), are identified in the table 
below. 

5 



TABLE 1 

Subclone Position in "Reverse" Sequence 



10 5 '-end 3 ' -end 

XI 522 643 

Y2 594 667 

Y3 508 665 

Y4 558 752 

15 Y5 545 665 



Since all of the gene sequences identified in 
the table must contain the coding sequence for the 

20 epitope, it is apparent that the coding sequence for 
the epitope falls in the region between nucleotide 594 
(5 '-end) and 643 (3* -end). Genetic sequences 
equivalent to and complementary to this relatively 
short sequence are therefore particularly preferred 

25 aspects of the present invention, as are peptides 
produced using this coding region. 

A second series of clones identifying an 
altogether different epitope was isolated with only 
Mexican serum. 

30 
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10 



TABLE 2 

Subclone Position in "Forward" Sequence 

5"end 3' end 

£ I'l 2 193 

135 



ET 8-3 2 
ET 9-1 2 
ET 13.-1 2 



109 
101 



The coding system for this epitope falls 

between nucleotide 2. (5'-end) and 101 (3'-end). 

Genetic sequences related to this short sequence are 
15 therefore also preferred, as are peptides produced 

using this coding region. 

While the invention has been described with 

reference to particular embodiments, methods, 

construction and use, it will be apparent to those 
2 0 skilled in the art that various changes and 

modifications can be made without departing from the 

invention. 



25 
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WE CLAIM: 

1. A protein derived from an enter ically transmitted 
non-A/non-B viral hepatitis agent whose genome 
5 contains a region which is homologous to the 1.33 

kb DMA EcoRI insert present in plasmid pTZ- 
KFl(ETl.l) carried in E. coli strain BB4 and having 
ATCC deposit no, 67717. 

10 2. The protein of claim 1, which is encoded by a 
coding region within said 1.33 kb Eco RI insert. 

3. A recombinant protein derived from an enterically 
transmitted nonA/nonB viral hepatitis agent whose 
15 genome contains a region which is homologous to a 

duplex DNA having a first sequence: 

1 11 21 31 41 51 

* * * * * * 

20 CGAATTCCGCCAACTGATGGAAGGCACTAATCTGGCAAGACCTGTCCCTGTTGCAGCTGT 

fil 71 81 91 101 111 

* * * * * * 



25 



TCTACCACCCTGCCCCGAGCTCGAACAGGGCCTTCTCTACCTGCCCCAGGAGCTCACCAC 

121 131 141 151 161 171 

* * . * * * * 

CTGTGATAGTGTCGTAACATTTGAATTAACAGACATTGTGCACTGCCGCATGGCCGCCCC 



30 
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181 "I 201 211 221 231 

* *. * * 

GAGCCAGCGCAAGGCCGTGCTGTCCACACTCGTGGGCCGCTACGGCGTCGCACAAAGCTC 

5 24 1 251 261 271 281 291 

***** 

TACAATGCTTCCCACTCTGATGTTCGCGACTCTCTCGCCCGTTTTATCCCGGCCATTGGC 

301 311 321 331 341 351 

10 * * * * * 

CCCGTACAGGTTACAACTTGTGAATTGTACGAGCTAGTGGAGGCCATGGTCGAGAAGGGC 

36i 371 381 391 401 411 

* * * . * * * 

15 CAGGATGGCTCCGCCGTCCTTGAGCTTGATCTTTGCAACCGTGACGTGTCCAGGATCACC 

42i 431 441 451 461 ' 471 

* * * * . * * 

TTCTTCCAGAAAGATTGTAACAAGTTCACCACAGGTGAGACCATTGCCCATGGTAAAGTG 

20 

48i 491 501 511 521 531 



* 



* 



GGCCAGGGCATCTCGGCCTGGAGCAAGACCTTCTGCGCCCTCTTTGGCCCTTGGTTCCGC 

25 541 551 561 571 581 591 

****** 
GCTATTGAGAAGGCTATTCTGGCCCKK^CCCTCAGGGTGTGTTTTACGGTGATGCCTCT 

6 °1 611 621 631 641 651 

30 * * * * 

GATGACACCGTCTTCTCGGCGGCTGTGGCCGCAGCAAAGGQWPCCATGGTGTTTGAGAAT 

661 671 681 691 701 711 

* • 

3 5 GACTTTTCTGAGTTTGACTCCACCCAGAATAACTTTTCTCTGGGTCTAGAGTGTGCTATT 
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721 731 741 751 761 771 

* * * * * * 

ATGGAGGAGTGTGGGATGCCGCAGTGGCTCATCCGCCTGTATCACCTTATAAGGTCTGCG 

5 781 791 801 811 821 831 

****** 

TGGATCTTGCAGGCCCCGAAGGAGTCTCTGCGAGGGTTTTGGAAGAAACACTCCGGTGAG 

841 851 861 871 881 891 

10 * * * * * * 

CCCGGCACTCTTCTATGGAATACTGTCTGGAATATGGCCGTTATTACCCACTGTTATGAC 

901 911 921 931 941 951 

* * * * * * 

15 TTCCGCGATTTTCAGGTGGCTGCCTTTAAAGGTGATGATTCGATAGTGCTTTGCAGTGAG 

961 971 981 991 1001 1011 

* ***** 

TATCGTCAGAGTCCAGGAGCTGCTGTCCTGATCGCCGGCTGTGGCTTGAAGTTGAAGGTA 

20 

1021 1031 1041 1051 1061 1071 

* ***** 

GATTTCCGCCCGATCGGTTTGTATGCAGGTGTTGTGGTGGCCCCCGGCCTTGGCGCGCTC 

25 1081 1091 1101 1111 1121 1131 

* ***** 

CCTGATGTTGTGCGCTTCGCCGGCCGGCTTACCGAGAAGAATTGGGGCCCTGGCCCTGAG 

1141 1151 1161 1171 1181 1191 

30 * * * * * * 

CGGGCGGAGCAGCTCCGCCTCGCTGTTAGTGATTTCCTCCGCAAGCTCACGAATGTAGCT 

1201 1211 1221 1231 1241 1251 

* ***** 

35 CAGATGTGTGTGGATGTTGTTTCCCGTGTTTATGGGGTTTCCCCTGGACTCGTTCATAAC 
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1261 1271 1281 1291 1301 1311 

* * * * * * 
CTGATTGGCATGCTACAGGCTGTTGCTGATGGCAAGGCACATTTCACTGAGTCAGTAAAA 

5 1321 1331 1341 

* * * 

CCAGTGCTCGACCGGAATTCGAGC , 
a second sequence 

10 

1 11 . 21 31 41 51 

* * * * * * 

GCTCGAATTCCGGTCGAGCACTGGTTTTACTGACTCAGTGAAATGTGCCTTGCCATCAGC 

15 61 71 81 91 ioi in 

* * * * * * 

AACAGCCTGTAGCATGCCAATCAGGTTATGAACGAGTCCAGGGGAAACCCCATAAACACG 

1^1 131 141 151 161 171 

20 * * * * * * 

GGAAACAACATCCACACACATCTGAGCTACATTCGTGAGCTTGCGGAGGAAATCACTAAC 

181 191 201 211 221 231 

****** 
25 AGCGAGGCGGAGCTGCTCCGCCCGCTCAGGGCCAGGGCCCCAATTCTTCTCGGTAAGCCG 

•241 251 261 271 281 291 

****** 

GCCGGCGAAGCGCACAACATCAGGGAGCGCGCCAAGGCCGGGGGCCACCACAACACCTGC 

30 

301 311 321 331 341 351 

****** 

ATACAAACCGATCGGGCGGAAATCTACCTTCAACTTCAAGCCACAGCCGGCGATCAGGAC 



35 361 371 381 391 401 411 

* * * * 

AGCAGCTCCTGGACTCTGACGATACTCACTGCAAAGCACTATCGAATCATCACCTTTAAA 
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421 431 441 451 461 471 

* * * * * * 

GGCAGCCACCTGAAAATCGCGGAAGTCATAACAGTGGGTAATAACGGCCATATTCCAGAC 



5 



481 491 501 511 521 531 

****** 

AGTATTCCATAGAAGAGTGCCGGGCTCACCGGAGTGTTTCTTCCAAAACCCTCGCAGAGA 

10 541 551 561 571 581 591 

***** * 

CTCCTTCGGGGCCTGCAAGATCCACGCAGACCTTATAAGGTGATACAGGCGGATGAGCCA 

601 611 621 631 641 651 

15 * * * * * * 

CTGCGGCATCCCACACTCCTCCATAATAGCACACTCTAGACCCAGAGAAAAGTTATTCTG 

661 671 681 691 701 711 

****** 

2 0 GGTGGAGTCAAACTCAGAAAAGTCATTCTCAAACACCATGGATGCCTTTGCTGCGGCCAC 

721 731 741 751 761 771 

****** 

AGCCGCCGAGAAGACGGTGTCATCAAAGGCATCACCGTAAAACACACCCTGAGGGAGCAG 



25 



781 791 801 811 821 831 

***** * 

GGCCAGAATAGCCTTCTCAATAGCGCGGAACCAAGGGCCAAAGAGGGCGCAGAAGGTCTT 



30 841 851 861 871 881 891 

* * * * * * 

GCTCCAGGCCGAGATGCCCTGGCCCACTTTACCATGGGCAATGGTCTCACCTGTGGTGAA 



901 911 921 931 941 951 

35 * * * * * * 

CTTGTTACAATCTTTCTGGAAGAAGGTGATCCTGGACACGTCACGGTTGCAAAGATCAAG 



15 



20 



30 



35 
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961 971 981 991 1001 1011 

CTCAAGGACGGCGGAGCCATCCTGGCCCTTCTCGACCATGGCCTCCACTAGCTCGTACAA 

5 1021 1031 1041 1051 1061 1071 

* * * 
TTCACAAGTTGTAACCTGTACGGGGCCAATGGCCGGGATAAAACGGGCGAGAGAGTCGCG 

1081 1091 1101 mi 1121 U31 

10 * * * * 

AACATCAGAGTGGGAAGCATTGTAGAGCTTTGTGCGACGCCGTAGCGGCCCACGAGTGTG 

1141 1151 1161 1171 n 81 119i 

* * * 
GACAGCACGGCCTTGCGCTGGCTCGGGGCGGCCATGCGGCAGTGCACAATGTCTGTTAAT 



1201 1211 1221 1231 1241 1251 

* * * * 

TCAAATGTTACGACACTATCACAGGTGGTGAGCTCCTGGGGCAGGTAGAGAAGGCCCTGT 



1261 1271 1281 1291 



1301 1311 



* 



* * * * * 
TCGAGCTCGGGGCAGGGTGGTAGAACAGCTGCAACAGGGACAGGTCTTGCCAGATT- 

25 1321 1331 1341 

* * * 

CCTTCCATCAGTTGGCGGAATTCG , 



AGTG 



a third sequence 

1 CGGTTGTTCA GTACCAGTTT ACTGCAGGTG TGCCTGGATC CGGCAAGTCC 

51 CGCTCTATCA CCCAAGCCGA TGTGGACGTT GTCGTGGTCC CGACGCGTGA 

101 GTTGCGTAAT GCCTGGCGCC GTCGCGGCTT TGCTGCTTTT ACCCCGCATA 

151 CTGCCGCCAG AGTCACCCAG GGGCGCCGGG TTGTCATTGA TGAGGCTCCA 

201 TCCCTCCCCC CTCACCTGCT GCTGCTCCAC ATGCAGCGGG CCGCCACCGT 

251 CCACCTTCTT GGCGCCCCGA ACCAGATCCC AGCCATCGAC TTTGAGCACG 

301 CTGGGCTCGT CCCCGCCATC AGGCCCGACT TAGCCCCACC TCCTGGTGGC 
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351 ATGTTACCCA TCGCTGCCCT GCGGATGTAT GCGAGCTCAT CCGTGGTGCA 
401 TACCCCATGA TCCAGACCAC TAGCCGGGTT CTCCGTTCGT TGTTCTGGGG 
451 TGAGCCTGCC GTCGGGCAGA AACTAGTGTT CACCCAGGCG GCCAAGGCCG 
501 CCAACCCCGG CTCAGTGACG GTCCACGAGG CGCAGGGCGC TACCTACACG 
5 551 GAGACCACTA TTATTGCCAC AGCAGATGCC CGGGGCCTTA TTCAGTCGTC 

601 TCGGGCTCAT GCCATTGTTG CTCTGACGCG CCACACTGAG AAGTGCGTCA 
651 TCATTGACGC ACCAGGCCTG CTTCGCGAGG TGGGCATCTC CGATGCAATC 
701 GTTAATAACT TTTTCCTCGC TGGTGGCGAA ATTGGTCACC AGCGCCCATC 
751 AGTTATTCCC CGTGGCAACC CTGACGCCAA TGTTGACACC CTGGCTGCCT 
10 801 TCCCGCCGTC TTGCCAGATT AGTGCCTTCC ATCAGTTGGC TGAGGAGCTT 

851 GGCCACAGAC CTGTCCCTGT TGCAGCTGTT CTACCACCCT GCCCCGAGCT 
901 CGAACAGGGC CTTCTCTACC TGCCCCAGGA GCTCACCACC TGTGATAGTG 
951 TCGTAACATT TGAATTAACA GACATTGTGC ACTGCCGCAT GGCCGCCCCG 
1001 AGCCAGCGCA AGGCCGTGCT GTCCACACTC GTGGGCCGCT ACGGCGTCGC 
15 1051 ACAAAGCTCT ACAATGCTTC CCACTCTGAT GTTCGCGACT CTCTCGCCCG 

1101 TTTTATCCCG GCCATTGGCC CCGTACAGGT TACAACTTGT GAATTGTACG 
1151 AGCTAGTGGA GGCCATGGTC GAGAAGGGCC AGGATGGCTC CGCCGTCCTT 
1201 GAGCTTGATC TTTGCAACCG TGACGTGTCC AGGATCACCT TCTTCCAGAA 
1251 AGATTGTAAC AAGTTCACCA CAGGTGAGAC CATTGCCCAT GGTAAAGTGG 
20 1301 GCCAGGGCAT CTCGGCCTGG AGCAAGACCT TCTGCGCCCT CTTTGGCCCT 

1351 TGGTTCCGCG CTATTGAGAA GGCTATTCTG GCCCTGCTCC CTCAGGGTGT 
1401 GTTTTACGGT GATGCCTTTG ATGACACCGT CTTCTCGGCG GCTGTGGCCG 
1451 CAGCAAAGGC ATCCATGGTG TTTGAGAATG ACTTTTCTGA GTTTGACTCC 
1501 ACCCAGAATA ACTTTTCTCT GGGTCTAGAG TGTGCTATTA TGGAGGAGTG 
25 1551 TGGGATGCCG CAGTGGCTCA TCCGCCTGTA TCACCTTATA AGGTCTGCGT 

1601 GGATCTTGCA GGCCCCGAAG GAGTCTCTGC GAGGGTTTTG GAAGAAACAC 
1651 TCCGGTGAGC CCGGCACTCT TCTATGGAAT ACTGTCTGGA ATATGGCCGT 
1701 TATTACCCAC TGTTATGACT TCCGCGATTT TCAGGTGGCT GCCTTTAAAG 
1751 GTGATGATTC GATAGTGCTT TGCAGTGAGT ATCGTCAGAG TCCAGGAGCT 
30 1801 GCTGTCCTGA TCGCCGGCTG TGGCTTGAAG TTGAAGGTAG ATTTCCGCCC 

1851 GATCGGTTTG TATGCAGGTG TTGTGGTGGC CCCCGGCCTT GGCGCGCTCC 
1901 CTGATGTTGT GCGCTTCGCC GGCCGGCTTA CCGAGAAGAA TTGGGGCCCT 
1951 GGCCCTGAGC GGGCGGAGCA GCTCCGCCTC GCTGTTAGTG ATTTCCTCCG 
2001 CAAGCTCACG AATGTAGCTC AGATGTGTGT GGATGTTGTT TCCCGTGTTT 
35 2051 ATGGGGTTTC CCCTGGACTC GTTCATAACC TGATTGGCAT GCTACAGGCT 

2101 GTTGCTGATG GCAAGGCACA TTTCACTGAG TCAGTAAAAC CAGTGCTCGA 
2151 CTTGACAAAT TCAATCTTGT GTCGGGTGGA ATGAATAACA TGTCTTTTGC 
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2201 TGCGCCCATG GGTTCGCGAC CATGCGCCCT CGGCCTATTT TGTTGCTGCT 
2251 CCTCATGTTT TTGCCTATGC TGCCCGCGCC ACCGCCCGGT CAGCCGTCTG 
2301 GCCGCCGTCG TGGGCGGCGC AGCGGCGGTT CCGGCGGTGG TTTCTGGGGT 
2351 GACCGGGTTG ATTCTCAGCC CTTCGCAATC CCCTATATTC ATCCAACCAA 
5 2401 CCCCTTCGCC CCCGATGTCA CCGCTGCGGC CGGGGCTGGA CCTCGTGTTC 

2451 GCCAACCCGC CCGACCACTC GCGTCCGCTT GGCGTGACCA GGCCCAGCGC 
2501 CCCGCCGTTG CCTCACGTCG TAGACCTACC ACAGCTGGGG CCGCGCCGCT 
2551 AACCGCGGTC GCTCCGGCCC CG, 

10 or a fourth sequence 

1 GCGGCCGCTC GGTGGGGTTC CACCTCTCTT CCCATTCCGA ACAGAGAAGT 
51 TAAGCAAGGA ATTAATTCGC GGCCGCTCGT GTTGCGTGAG GTGGGTATCT 
101 CAGATGCCAT TGTTAATAAT TTCTTCCTTT CGGGTGGCGA GGTTGGTCAC 
15 151 CAGAGACCAT CGGTCATTCC GCGAGCAACC CTGACCGCAA TGTTGACGTG 

201 CTTGCGGCGT TTCCACCTTC ATGCCAAATA AGCGCCTTCC ATCAGCTTGC 
250 TGAGGAGCTG GGCCACCGGC GGCGCCGGTG CTGTGCTACC TCCCTGCCCT 
301 GAGCTTGAGC AGGGCCTTCT CTATCTGCCA CAGGAGCTAG CCTCCTGTGA 
350 CAGTGTTGTG ACATTTGAGC TAACTGACAT TGTGCACTGC CGCATGGCGG 
20 401 CCCCTAGCCA AAGGAAAGCT GTTTTGTCCA CGCTGGTAGG CCGGTATGGC 

451 AGACGCACAA GGCTTTATGA TGCGGGTCAC ACCGATGTCC GCGCCTCCCT 
501 TGCGCGCTTT ATTCCCACTC TCGGGCGGGT TACTGCCACC ACCTGTGAAC 
551 TCTTTGAGCT TGTAGAGGCG ATGGTGGAGA AGGGCCAAGA CGGTTCAGCC 
601 GTCCTCGAGT TGGATTTGTG CAGCCGAGAT GTCTCCCGCA TAACCTTTTT 
25 651 CCAGAAGGAT TGTAACAAGT TCACGACCGG CGAGACAATT GCGCATGGCA 

701 AAGTCGGTCA GGGTATCTCC GCCTGGAGTA AGACCTTTTG TGCCCTGTTT 
751 GGCCCCTGGT TCCGTTGCGA TTGAGAAGGC TATTCTATCC CMTTACCAC 
801 AAGCTGTGTT CTACGGGGAT GCTTATGACG ACTCAGTATT CTCTGCTGCC 
851 GTGGCTGGCG CCAGCCATGC CATGGTGTTT GAAAATGATT TTTCTGAGTT 
30 901 TGACTCGACT CAGAATAACT TTTCCCTAGG TCTTGAGTGC GCCATTATGG 

951 AAGAGTGTGG TATGCCCCAG TGGCTTGTCA GGTTGTACCA TGCCGTCCGG 
1001 TCGGCGTGGA TCCTGCAGGC CCCAAAAGAG TCTTTGAGAG GGTTCTGGAA 
1051 GAAGCATTCT GGTGAGCCGG GCACGTTGCT CTGGAATACG GTGTGGAACA 
1101 TGGCAATCAT TGCCCATTGC TATGAGTTCC GGGACCTCCA GGTTGCCGCC 
35 1151 TTCAAGGGCG ACGACTCGGT CGTCCTCTGT AGTGAATACC GCCAGAGCCC 

1201 AGGCGCCGGT TCGCTTATAG CAGGCTGTGG TTTGAAGCCG AAGGCTGACT 
1251 TCCGGCCGAT TGGGCTGTAT GCCGGGGTTG TCGTCGCCCC GGGGCTCGGG 
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1301 GCCCTACCCG ATGTCGTTCG ATTCGCCGGA CGGCTTTCGG AGAAGAACTG 
1351 GGGGCCTGAT CCGGAGCGGG CAGAGCAGCT CCGCCTCGCC GTGCAGGATT 
1401 TCCTCCGTAG GTTAACGAAT GTGGCCCAGA TTTGTGTTGA GGTGGTGTCT 
1451 AGAGTTTACG GGGTTTCCCC GGGTCTGGTT CATAACCTGA TAGGCATGCT 
1501 CCAGACTATT GGTGATGGTA AGGCGCATTT TACAGAGTCT GTTAAGCCTA 
1551 TACTTGACCT TACACACTCA ATTATGCACC GGTCTGAATG AATAACATGT 
1601 GGTTTCCTGC GCCCATGGGT TCGCCACCAT GCGCCCTAGG CCTCTTTTGC 
1651 CGAGCGGCCG C, 

or a sequence complementary thereto. 

4. A protein which is (a) immunoreactive with 
antibodies present in individuals infected with * 
enter ically transmitted nonA/nonB and (b) derived 
from a viral hepatitis agent whose genome contains 
a region which is homologous to the 1.33 kb DNA 
EcoRI insert present in plasmid pTZ-KFl(ETl.l) 
carried in E. coli strain BB4, and having ATCC 
deposit no. 67717. 

5. The protein of claim 4, which is encoded by a 
coding region within said 1.33 kb* EcoRI insert. 

6. A method of detecting infection by enter ically 
transmitted nonA/nonB hepatitis viral agent in a 
test individual, comprising: 

providing a peptide antigen which is (a) 
immunoreactive with antibodies present in 
individuals infected with enterically transmitted 
nonA/nonB hepatitis and (b) derived from a viral 
hepatitis agent whose genome contains a region 
which is homlogous to the 1.33 kb DNA EcoRI insert 
present in plasmid pTZ-KFl(ETl.l) carried in E^ 
coli strain BB4 r and having ATCC deposit no. 67717, 

reacting serum from the test individual 
with such antigen, and 



examining the antigen for the presence of 
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bound antibody, 

7. The method of claim 6, wherein the serum antibody 
is an IgM or IgG antibody, or a mixture of both, 

5 the antigen. provided is attached to a support, said 

reacting includes contacting such serum with the 
support and said examining includes reacting the 
support and bound serum antibody with a reporter- 
labeled anti-human antibody. 

10 

8. A kit for ascertaining the presence of serum 
antibodies which are diagnostic of enterically 
transmitted nonA/nonB hepatitis infection, 
comprising 

15 a support with surface-bound recombinant 

peptide antigen which is (a) immunoreactive with 
antibodies present in individuals infected with 
enterically transmitted nonA/nonB viral hepatitis 
agent and (b) derived from a viral hepatitis agent 
whose genome contains a region which is homologous 
to the 1.33 kb DNA EcoRI insert present in plasmid 
pTZ-KFl(ETl.l) carried in E. coli strain BB4, and 
having ATCC deposit no. 67717, 

a reporter-labeled anti-human antibody. 
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9. A DNA fragment derived from an enterically 
transmitted nonA/nonB viral hepatitis agent whose 
genome contains a region which is homologous to the 
1.33 kb DNA EcoRI insert present in plasmid pTZ- 
KFl(ETl.l) carried in E. coli strain BB4 and having 
ATCC deposit no. 67717. 

10. The fragment of claim 9, which is derived from said 
1.33 kb Eco RI insert. 

11. A DNA fragment derived from an enterically 
transmitted nonA/nonB viral hepatitis agent whose 
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genome contains a region which is homologous to a 
duplex DNA fragment within a first sequence: 
1 11 21 31 41 51 

* ***** 

CGAATTCCGCCAACTGATGGAAGGCACTAATCTGGCAAGACCTGTCCCTGTTGCAGCTGT 

61 71 81 91 101 111 

****** 

TCTACCACCCTGCCCCGAGCTCGAACAGGGCCTTCTCTACCTGCCCCAGGAGCTCACCAC 



10 



121 131 141 151 161 171 

* ***** 

CTGTGATAGTGTCGTAACATTTGAATTAACAGACATTGTGCACTGCCGCATGGCCGCCCC 

15 181 191 201 211 221 231 

* ***** 

GAGCCAGCGCAAGGCCGTGCTGTCCACACTCGTGGGCCGCTACGGCGTCGCACAAAGCTC 

241 251 261 271 281 291 

20 * * * * * * 

TACAATGCTTCCCACTCTGATGTTCGCGACTCTCTCGCCCGTTTTATCCCGGCCATTGGC 



301 311 321 331 341 351 

25 * * * * * * 

CCCGTACAGGTTACAACTTGTGAATTGTACGAGCTAGTGGAGGCCATGGTCGAGAAGGGC 

361 371 381 391 401 411 

* ***** 

3 0 CAGGATGGCTCCGCCGTCCTTGAGCTTGATCTTTGCAACCGTGACGTGTCCAGGATCACC 

421 431 441 451 461 471 

* * * * * * 

TTCTTCCAGAAAGATTGTAACAAGTTCACCACAGGTGAGACCATTGCCCATGGTAAAGTG 



35 



481 491 501 511 521 531 
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GGCCAGGGCATCTCGGCCTGGAGCAAGACCTTCTGCGCCCTCTTTGGCCCTTGGTTCCGC 

541 551 561 571 581 591 

***** 
5 GCTATTGAGAAGGCTATTCTGGCCCTGCTCCCTCAGGGTGTGTTTTACGGTGATGCCTTT 

601 611 621 631 641 651 

* * * * * 

GATGACACCGTCTTCTCGGCGGCTGTGGCCGCAGCAAAGGCATCCATGGTGTTTGAGAAT 

10 

661 671 681 691 • 701 711 

* * * * * 

GACTTTTCTGAGTTTGACTCCACCCAGAATAACTTTTCTCTGGGTCTAGAGTGTGCTATT 

15 721 731 741 751 761 771 

* * * * * * 
ATGGAGGAGTGTGGGATGCCGCAGTGGCTCATCCGCCTGTATCACCTTATAAGGTCTGCG 

781 • 791 801 811 821 831 

20 * * * * * # 

.TGGATCTTGCAGGCCCCGAAGGAGTCTCTGCGAGGGTTTTGGAAGAAACACTCCGGTGAG 

841 851 861 871 881 891 

* * * * * 

25 CCCGGCACTCTTCTATGGAATACTGTCTGGAATATGGCCGTTATTACCCACTGTTATGAC 



901 911 921 931 941 951 

* * * * * . * 

TTCCGCGATTTTCAGGTGGCTGCCTTTAAAGGTGATGATTCGATAGTGCTTTGCAGTGAG 

30 

961 971 981 991 1001 1011 

***** 

TATCGTCAGAGTCCAGGAGCTGCTGTCCTGATCGCCGGCTGTGGCTTGAAGTTGAAGGTA 

35 1021 1031 1041 1051 1061 1071 

* * * - : SG * * * 
GATTTCCGCCCGATCGGTTTGTATGCAGGTGTTGTGGTGGCCCCCGGCCTTGGCGCGCTC 
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1081 1091 lioi mi U21 1131 

****** 



5 



CCTGATGTTGTGCGCTTCGCCGGCCGGCTTACCGAGAAGAATTGGGGCCCTGGCCCTGAG 
1141 1151 1161 1171 H81 i 191 

CGGGCGGAGCAGCTCCGCCTCGCTGTTAGTGATTTCCTCCGCAAGCTCACGAATGTAGCT 
10 1201 1211 1221 1231 1241 1251 

CAGATGTGTGTGGATGTTGTTTCCCGTGTTTATGGGGTTTCCCCTGGACTCGTTCATAAC 

1261 1271 1281 1291 1301 1311 

15 * 

CTGATTGGCATGCTACAGGCTGTTGCTGATGGCAAGGCACATTTCACTGAGTCAGTAAAA 

1321 1331 1341 

* * * 

20 CCAGTGCTCGACCGGAATTCGAGC , 
a second sequence 

1 n 21 31 41 si 

25 * 

GCTCGAATTCCGGTCGAGCACTGGTTTTACTGACTCAGTGAAATGTGCCTTGCCATCAGC 



30 



61 71 81 91 101 in 

* * * 

AACAGCCTGTAGCATGCCAATCAGGTTATGAACGAGTCCAGGGGAAACCCCATAAACACG 



35 



m 131 141 151 161 171 

* * * * * 

* * * 

GGAAACAACATCCACACACATCTGAGCTACATTCGTGAGCTTGCGGAGGAAATCACTAAC 



181 191 201 211 



221 231 



WO 89/12462 PCT/US89/02648 

68 

AGCGAGGCGGAGCTGCTCCGCCCGCTCAGGGCCAGGGCCCCAATTCTTCTCGGTAAGCCG 

241 251 261 271 281 291 

* * * * * * 

5 GCCGGCGAAGCGCACAACATCAGGGAGCGCGCCAAGGCCGGGGGCCACCACAACACCTGC 

301 311 321 331 341 351 

* * * * * * 

ATACAAACCGATCGGGCGGAAATCTACCTTCAACTTCAAGCCACAGCCGGCGATCAGGAC 

10 

361 371 381 391 - 401 411 

* * * * * * 

AGCAGCTCCTGGACTCTGACGATACTCACTGCAAAGCACTATCGAATCATCACCTTTAAA 

15 421 431 441 451 461 471 

****** 

GGCAGCCACCTGAAAATCGCGGAAGTCATAACAGTGGGTAATAACGGCCATATTCCAGAC 

481 491 501 511 521 531 

20 * * * * * * 

AGTATTCCATAGAAGAGTGCCGGGCTCACCGGAGTGTTTCTTCCAAAACCCTCGCAGAGA 



25 



30 



35 



WO 89/12462 PCI7US89/02648 

69 

541 551 561 571 581 591 

* ***** 

CTCCTTCGGGGCCTGCAAGATCCACGCAGACCTTATAAGGTGATACAGGCGGATGAGCCA 

5 601 611 621 631 641 651 

* ***** 

CTGCGGCATCCCACACTCCTCCATAATAGCACACTCTAGACCCAGAGAAAAGTTATTCTG 

661 671 681 691 701 711 

10 * * * * * * 

GGTGGAGTCAAACTCAGAAAAGTCATTCTCAAACACCATGGATGCCTTTGCTGCGGCCAC 

721 731 741 751 761 771 

* * * * * * 

AGCCGCCGAGAAGACGGTGTCATCAAAGGCATCACCGTAAAACACACCCTGAGGGAGCAG 



15 



20 



781 791 801 811 821 831 

* * * * * * 

GGCCAGAATAGCCTTCTCAATAGCGCGGAACCAAGGGCCAAAGAGGGCGCAGAAGGTCTT 

841 851 861 871 881 891 

* ***** 

GCTCCAGGCCGAGATGCCCTGGCCCACTTTACCATGGGCAATGGTCTCACCTGTGGTGAA 

25 901 911 921 931 941 951 

* ***** 

CTTGTTACAATCTTTCTGGAAGAAGGTGATCCTGGACACGTCACGGTTGCAAAGATCAAG 

961 971 981 991 1001 1011 

30 * * * * * * 

CTCAAGGACGGCGGAGCCATCCTGGCCCTTCTCGACCATGGCCTCCACTAGCTCGTACAA 

1021 1031 1041 1051 1061 1071 

****** 
3 5 TTCACAAGTTGTAACCTGTACGGGGCCAATGGCCGGGATAAAACGGGCGAGAGAGTCGCG 
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1081 1091 1101 1111 H21 



1131 



* * 



AACATCAGAGTGGGAAGCATTGTAGAGCTTTGTGCGACGCCGTAGCGGCCCACGAGTGTG 

5 1141 1151 1161 . 1171 1181 H91 

* ***** 

GACAGCACGGCCTTGCGCTGGCTCGGGGCGGCCATGCGGCAGTGCACAATGTCTGTTAAT 

1201 1211 1221 1231 1241 1251 

10 * * * * * , 

TCAAATGTTACGACACTATCACAGGTGGTGAGCTCCTGGGGCAGGTAGAGAAGGCCCTGT 

1261 1271 1281 . 1291 1301 13n 

* * . * * * * 

15 TCGAGCTCGGGGCAGGGTGGTAGAACAGCTGCAACAGGGACAGGTCTTGCCAGATTAGTG 

1321 1331 1341 

* * * 

CCTTCCATCAGTTGGCGGAATTCG , 

20 

a third sequence 





1 


CGGTTGTTCA 


GTACCAGTTT 




51 


CGCTCTATCA 


CCCAAGCCGA 


25 


101 


GTTGCGTAAT 


GCCTGGCGCC 




151 


CTGCCGCCAG 


AGTCACCCAG 




201 


TCCCTCCCCC 


CTCACCTGCT 




251 


CCACCTTCTT 


GGCGCCCCGA 




301 


CTGGGCTCGT 


CCCCGCCATC 


30 


351 


ATGTTACCCA 


TCGCTGCCCT 




401 


TACCCCATGA 


TCCAGACCAC 




451 


TGAGCCTGCC 


GTCGGGCAGA 




501 


CCAACCCCGG 


CTCAGTGACG 




551. 


GAGACCACTA 


TTATTGCCAC 


35 


601 


TCGGGCTCAT 


GCCATTGTTG 




651 


TCAXTGACGC 


ACCAGGCCTG 




701 


GTTAATAACT 


TOTTCCTCGC 



WO 89/1 2462 PCIV US89/02648 

71 

751 AGTTATTCCC CGTGGCAACC CTGACGCCAA TGTTGACACC CTGGCTGCCT 
801 TCCCGCCGTC TTGCCAGATT AGTGCCTTCC ATCAGTTGGC TGAGGAGCTT 
851 GGCCACAGAC CTGTCCCTGT TGCAGCTGTT CTACCACCCT GCCCCGAGCT 
901 CGAACAGGGC CTTCTCTACC TGCCCCAGGA GCTCACCACC TGTGATAGTG 
5 951 TCGTAACATT TGAATTAACA GACATTGTGC ACTGCCGCAT GGCCGCCCCG 

1001 AGCCAGCGCA AGGCCGTGCT GTCCACACTC GTGGGCCGCT ACGGCGTCGC 
1051 ACAAAGCTCT ACAATGCTTC CCACTCTGAT GTTCGCGACT CTCTCGCCCG 
1101 TTTTATCCCG GCCATTGGCC CCGTACAGGT TACAACTTGT GAATTGTACG 
1151 AGCTAGTGGA GGCCATGGTC GAGAAGGGCC AGGATGGCTC CGCCGTCCTT 
10 1201 GAGCTTGATC TTTGCAACCG TGACGTGTCC AGGATCACCT TCTTCCAGAA 

1251 AGATTGTAAC AAGTTCACCA CAGGTGAGAC CATTGCCCAT GGTAAAGTGG 
1301 GCCAGGGCAT CTCGGCCTGG AGCAAGACCT TCTGCGCCCT CTTTGGCCCT 
1351 TGGTTCCGCG CTATTGAGAA GGCTATTCTG GCCCTGCTCC CTCAGGGTGT 
1401 GTTTTACGGT GATGCCTTTG ATGACACCGT CTTCTCGGCG GCTGTGGCCG 
15 1451 CAGCAAAGGC ATCCATGGTG TTTGAGAATG ACTTTTCTGA GTTTGACTCC 

1501 ACCCAGAATA ACTTTTCTCT GGGTCTAGAG TGTGCTATTA TGGAGGAGTG 
1551 TGGGATGCCG CAGTGGCTCA TCCGCCTGTA TCACCTTATA AGGTCTGCGT 
1601 GGATCTTGCA GGGCCCGAAG GAGTCTCTGC GAGGGTTTTG GAAGAAACAC 
1651 TCCGGTGAGC CCGGCACTCT TCTATGGAAT ACTGTCTGGA ATATGGCCGT 
20 1701 TATTACCCAC TGTTATGACT TCCGCGATTT TCAGGTGGCT GCCTTTAAAG 

1751 GTGATGATTC GATAGTGCTT TGCAGTGAGT ATCGTCAGAG TCCAGGAGCT 
1801 GCTGTCCTGA TCGCCGGCTG TGGCTTGAAG TTGAAGGTAG ATTTCCGCCC 
1851 GATCGGTTTG TATGCAGGTG TTGTGGTGGC CCCCGGCCTT GGCGCGCTCC 
1901 CTGATGTTGT GCGCTTCGCC GGCCGGCTTA CCGAGAAGAA TTGGGGCCCT 
25 1951 GGCCCTGAGC GGGCGGAGCA GCTCCGCCTC GCTGTTAGTG ATTTCCTCCG 

2001 CAAGCTCACG AATGTAGCTC AGATGTGTGT GGATGTTGTT TCCCGTGTTT 
2051 ATGGGGTTTC CCCTGGACTC GTTCATAACC TGATTGGCAT GCTACAGGCT 
2101 GTTGCTGATG GCAAGGCACA TTTCACTGAG TCAGTAAAAC CAGTGCTCGA 
2151 CTTGACAAAT TCAATCTTGT GTCGGGTGGA ATGAATAACA TGTCTTTTGC 
30 2201 TGCGCCCATG GGTTCGCGAC CATGCGCCCT CGGCCTATTT TGTTGCTGCT 

2251 CCTCATGTTT TTGCCTATGC TGCCCGCGCC ACCGCCCGGT CAGCCGTCTG 
2301 GCCGCCGTCG TGGGCGGCGC AGCGGCGGTT CCGGCGGTGG TTTCTGGGGT 
2351 GACCGGGTTG ATTCTCAGCC CTTCGCAATC CCCTATATTC ATCCAACCAA 
2401 CCCCTTCGCC CCCGATGTCA CCGCTGCGGC CGGGGCTGGA CCTCGTGTTC 
35 2451 GCCAACCCGC CCGACCACTC GCGTCCGCTT GGCGTGACCA GGCCCAGCGC 

2501 CCCGCCGTTG CCTCACGTCG TAGACCTACC ACAGCTGGGG CCGCGCCGCT 
2551 AACCGCGGTC GCTCCGGCCC CG, 
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or a fourth sequence 

1 GCGGCCGCTC GGTGGGGTTC CACCTCTCTT CCCATTCCGA ACAGAGAAGT 
5 51 TAAGCAAGGA ATTAATTCGC GGCCGCTCGT GTTGCGTGAG GTGGGTATCT 

101 CAGATGCCAT TGTTAATAAT TTCTTCCTTT CGGGTGGCGA GGTTGGTCAC 
151 CAGAGACCAT CGGTCATTCC GCGAGCAACC CTGACCGCAA TGTTGACGTG 
201 CTTGCGGCGT TTCCACCTTC ATGCCAAATA AGCGCCTTCC ATCAGCTTGC 
250 TGAGGAGCTG GGCCACCGGC GGCGCCGGTG CTGTGCTACC TCCCTGCCCT 
10 301 GAGCTTGAGC AGGGCCTTCT CTATCTGCCA CAGGAGCTAG CCTCCTGTGA 

350 CAGTGTTGTG ACATTTGAGC TAACTGACAT TGTGCACTGC CGCATGGCGG 
401 CCCCTAGCCA AAGGAAAGCT GTTTTGTCCA CGCTGGTAGG CCGGTATGGC 
451 AGACGCACAA GGCTTTATGA TGCGGGTCAC ACCGATGTCC GCGCCTCCCT 
501 TGCGCGCTTT ATTCCCACTC TCGGGCGGGT TACTGCCACC ACCTGTGAAC 
15 551 TCTTTGAGCT TGTAGAGGCG ATGGTGGAGA AGGGCCAAGA CGGTTCAGCC 

601 GTCCTCGAGT TGGATTTGTG CAGCCGAGAT GTCTCCCGCA TAACCTTTTT 
651 CCAGAAGGAT TGTAACAAGT TCACGACCGG CGAGACAATT GCGCATGGCA 
701 AAGTCGGTCA GGGTATCTCC GCCTGGAGTA AGACCTTTTG TGCCCTGTTT 
751 GGCCCCTGGT TCCGTTGCGA TTGAGAAGGC TATTCTATCC CTTTTACCAC 
20 801 AAGCTGTGTT CTACGGGGAT GCTTATGACG ACTCAGTATT CTCTGCTGCC 

851 GTGGCTGGCG CCAGCCATGC CATGGTGTTT GAAAATGATT TTTCTGAGTT 
901 TGACTCGACT CAGAATAACT TTTCCCTAGG TCTTGAGTGC GCCATTATGG 
951 AAGAGTGTGG TATGCCCCAG TGGCTTGTCA GGTTGTACCA TGCCGTCCGG 
1001 TCGGCGTGGA TCCTGCAGGC CCCAAAAGAG TCTTTGAGAG GGTTCTGGAA 
25 1051 GAAGCATTCT GGTGAGCCGG GCACGTTGCT CTGGAATACG GTGTGGAACA 

1101 TGGCAATCAT TGCCCATTGC TATGAGTTCC GGGACCTCCA GGOTGCCGCC 
1151 TTCAAGGGCG ACGACTCGGT CGTCCTCTGT AGTGAATACC GCCAGAGCCC 
1201 AGGCGCCGGT TCGCTTATAG CAGGCTGTGG TTTGAAGCCG AAGGCTGACT 
1251 TCCGGCCGAT TGGGCTGTAT GCCGGGGTTG TCGTCGCCCC GGGGCTCGGG 
30 1301 GCCCTACCCG ATGTCGTTCG ATTCGCCGGA CGGCTTTCGG AGAAGAACTG 

1351 GGGGCCTGAT CCGGAGCGGG CAGAGCAGCT CCGCCTCGCC GTGCAGGATT 
1401 TCCTCCGTAG GTTAACGAAT GTGGCCCAGA TTTGTGTTGA GGTGGTGTCT 
1451 AGAGTTTACG GGGTTTCCCC GGGTCTGGTT CATAACCTGA TAGGCATGCT 
1501 CCAGACTATT GGTGATGGTA AGGCGCATTT TACAGAGTCT GTTAAGCCTA 
35 1551 TACTTGACCT TACACACTCA ATTATGCACC GGTCTGAATG AATAACATGT 

1601 GGTTTCCTGC GCCCATGGGT TCGCCACCAT GCGCCCTAGG CCTCTTTTGC 
1651 CGAGCGGCCG C, 
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or a sequence complementary thereto, 

12. The DNA fragment of claim 11, wherein said fragment 
5 contains a coding sequence homologous to nucleotides 2 

through 101 of said first sequence, nucleotides 594 
through 643 of said second sequence, or a sequence 
complementary to said coding sequences. 

10 13 • A kit comprising, in a container or separate containers, 
a pair of single-strand primers derived from non- 
homologous regions of opposite strands of a DNA duplex 
fragment derived from an enterically transmitted viral 
hepatitis agent whose genome contains a region which is 

15 homologous to the 1,33 kb DNA EcoRI insert present in 

plasmid pTZ-KFl(ETl.l) carried in E. coli strain BB4 and 
having ATCC deposit no. 67717. 

14. The kit of claim 13, which are derived from opposite 
20 strands of the Eco RI duplex insert in said plasmid. 

15. A method for detecting the presence of an enterically 
transmitted nonA/nonB hepatitis viral agent in a 
biological sample, comprising 

25 preparing a mixture of duplex DNA 

fragments derived from the sample, 
denaturing the duplex fragments, 
adding to the denatured DNA fragments, a 
pair of single-strand primers derived from non- 
30 homologous regions of opposite strands of a DNA duplex 
fragment derived from an enterically transmitted viral 
hepatitis agent whose genome contains a region which is 
homologous to the 1.33 kb DNA Eco RI insert present in 
plasmid pTZ-KFl(ETl.l) carried in E. coli strain BB4, 
35 and having ATCC deposit no. 67717, 

hybridizing said primers to homologous- 
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sequence region of opposite strands of such duplex DNA 
fragments derived from enterically transmitted nonA/nonB 
hepatitis agent, 

reacting the primed fragment strands with DNA 
5 polymerase in the. presence of DNA nucleotides, to form 

new DNA duplexes containing the primer sequences, and 

repeating said denaturing, adding, hybridizing 
and reacting steps, until a desired degree of 
amplification of sequences is achieved. 



10 
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16. The method of claim 15, wherein the primers are derived 
from opposite strands the EcoRl duplex insert in said 
plasmid. 

17. The method of claim 15, for detecting the presence of 
viral agent in a sample of cultured cells infected with 
the agent. 

18. A vaccine for immunizing an individual against 
enterically transmitted nonA/nonB hepatitis viral agent 
comprising, in a pharmacologically acceptable adjuvant, 
a recombinant protein derived from an enterically 
transmitted nonA/nonB viral hepatitis agent whose genome 
contains a region which is homologous to the 1.33 kb DNA 
EcoRI insert present in plasmid pTZ-KFl(ETl.l) carried 
in E. coli strain BB4, and having ATCC deposit no. 
67717. 

19. The vaccine of claim 18, wherein the protein is derived 
from the EcoRI insert in said plasmid. 

20. in a method of isolating an enterically transmitted non- 
A/nonB viral agent or a nucleic acid fragment produced 
by the agent, an improvement which comprises: 

utilizing, as a source of said agent, bile obtained 
from a human or cynomolgus monkey having an active • 
infection of enterically transmitted non-A/non-B 
hepatitis. 
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21. The method of claim 20, wherein the bile is obtained 
from an infected cynomologus monkey. 

5 22. Human polyclonal anti-serum obtained from a human 

immunized with a protein derived form an enterically 
transmitted non-A/non-B viral hepatitis agent whose 
genome contains a region which is homologous to the 1.33 
kb DNA EcoR l insert present in plasmid pTZ-KFl ( ET1 . 1 ) 

0 carried in E. coli strain BB4 and having ATCC deposit 

no. 67717. 
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